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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: 

Stewart COLE et al. 

Serial No.: 09/936,523 

Filed: September 14, 2001 

For: DELETED SEQUENCES IN M. 
BOVIS BCG/M.BOVIS OR M. 
TUBERCULOSIS, METHOD FOR 
DETECTING MYCOBACTERIA 
USING THESE SEQUENCES AND 
VACCINES 



LAW OFFICES 

FiNNEGAN, Henderson, 

FARABOW, GARRETT; 

S DUNNER,LL.P. 
1300 I STREET^ N. W. 
WASHINGTON, DC 30005 
202-408-4000 



Commissioner of Patents and Trademarks 
Wasliington, DC 20231 



Sir: 



PRELIMINARY AMENDMENT 



Prior to the examination of the above application, please amend this application 
as follows: 

IN THE CLAIMS: 



Please cancel claims 7 and 24. 

Please amend the following claims: 
1 . (Amended) A nucleotide or polynucleotide sequence deleted from the genome of 
M. bovis BCG/M. bovis and present in the genome of M. tuberculosis or a nucleotide or 
polynucleotide sequence of the following ORFs and genes: Rv2346c, Rv2347c, 
Rv2348c, p/cC, p/cB, pIcA, Rv2352c, Rv2353c, Rv3425, Rv3426, Rv3427c, Rv3428c, 
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Rv1964, Rv1965, mce3, Rv1967, Rv1968, Rv1969, IprM, Rv1971, Rv1972, Rv1973, 

' Rv1 974, Rv1 975, Rv1 976c, Rv1 977, ephA, Rv361 8, Rv361 9c, Rv3620c, Rv3621 c, 

Rv3622c, IPqG, cobL, Rv2073c, Rv2074, Rv2075, echAI, Rv0223c, RvD1-0RF1, RvD1- 

0RF2, Rv2024c, pIcD, RvD2-0RF1, RvD2-ORF2, RvD2-ORF3, or Rv1758. 

2. (Amended) The nucleotide or polynucleotide sequences as claimed in claim 1 
grouped together in nucleotide regions RD5 to RD10 and RvDI and RvD2 according to 
the following distribution: 

□ (A) RD5: Rv2346c, Rv2347c, Rv2348c, pIcC, pIcB, pIcA, Rv2352c, Rv2353c; 

W (B) RD6: Rv3425, Rv3426, Rv3427c, Rv3428c; 

(C) RD7: Rv1964, Rv1965, mce3, Rv1967, Rv1968, Rv1969, IprM, Rv1971, 
U Rv1 972, Rv1 973, Rv1 974, Rv1 975, Rv1 976c, Rv1 977; 

2 (D) RD8: ephA, Rv3618, Rv3619c, Rv3620c, Rv3621c, Rv3622c, IpqG; 

^' (E) RD9: cojbL, Rv2073c, Rv2074, Rv2075c; 

(F) RD10: ecM/, Rv0223c; 

(G) RvD1: RvD1-0RF1, RvD1-0RF2, Rv2024c; and 

(H) RvD2: pIcD, RvD2-0RF1, RvD2-ORF2, RvD2-ORF3, Rv1758. 

uAw orp,c^s 3 (Amended) A method for the discriminatory detection and identification of M. 

FiNNEGAN, Henderson, 
Farabow, Garrett, 

,3®o?i sTRE^i^, H.W. bovis BCG/M. bovis or M. tuberculosis in a biological sample, comprising: 

WASHINGTON, DC £0005 
2O2-4O8-4000 
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(A) isolating the DNA from the biological sample to be analyzed or production of a 
cDNA from the RNA of the biological sample; 

(B) detecting the DNA sequences of the mycobacterium present in said biological 
sample; and 

(C) analyzing said sequences with the nucleotide and polynucleotide sequences as 
claimed in claim 1 . 

4. (Amended) The method as claimed in claim 3, wherein the detection of the 
mycobacterial DNA sequences Is carried out using nucleotide sequences 
complementary to said DNA sequences. 

5. (Amended) The method as claimed in claim 3, wherein the detection of the 
mycobacterial DNA sequences is carried out by amplifying the sequences using 
primers. 

6. (Amended) The method as claimed in claim 5, wherein the primers have a 
nucleotide sequence chosen from the group comprising SEQ ID No. 1, SEQ ID No. 2, 
SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7, SEQ ID No. 
8, SEQ ID No. 9, SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12, SEQ ID No. 13, SEQ 
ID No. 14, SEQ ID No. 15, SEQ ID No. 16, SEQ ID No. 17, and SEQ ID No. 18 wherein: 

(A) the pair SEQ ID No. 1/SEQ ID No. 2 is specific for RD4; 

(B) the pair SEQ ID No. 3/SEQ ID No. 4 is specific for RD5; 
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(C) the pair SEQ ID No. 5/SEQ ID No. 6 is specific for RD6; 

(D) the pair SEQ ID No. 7/SEQ ID No. 8 is specific for RD7; 

(E) the pair SEQ ID No. 9/SEQ ID No. 10 is specific for RD8; 

(F) the pair SEQ ID No. 1 1/SEQ ID No. 12 is specific for RD9; 

(G) the pair SEQ ID No. 13/SEQ ID No. 14 is specific for RD10; 

(H) the pair SEQ ID No. 1 5/SEQ ID No. 16 is specific for RvDI ; and 

(I) the pair SEQ ID No. 1 7/SEQ ID No. 18 is specific for RvD2. 

8. (Amended) A method for the discriminatory detection and identification of M. 
bovis BCG/7W. bovis or M. tuberculosis in a biological sample, comprising: 

(A) bringing the biological sample to be analyzed into contact with at least one pair of 
primers as defined in claim 6, the DNA contained in the sample having been, where 
appropriate, made accessible to the hybridization beforehand; 

(B) amplifying the DNA of the mycobacterium; and 

(C) visualizing the amplification of the DNA fragments. 

9. (Amended) A kit for the discriminatory detection and identification of M. bovis 
BCG/M. bovis or M. tuberculosis in a biological sample comprising: 

(A) at least one pair of primers as defined in claim 6; 
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(B) reagents necessary to carry out a DNA amplification reaction; and 

(C) optionally, the necessary components, which make it possible to verify or compare 
the sequence, the size of the amplified fragment, or both the sequence and the size of 
the amplified fragment. 

1 0. (Amended) A method of amplifying a DNA sequence from M. bovis BCG/M. bovis 
or M. tuberculosis comprising hybridizing at least one of the pair of primers of claim 6 to 
the DNA sequence. 

1 1 . (Amended) A product of expression of all or part of a nucleotide or polynucleotide 
sequence deleted from the genome of M. bovis BCG/M. bovis and present in M. 
tuberculosis or a product of expression of all or a part of an ORF or gene of claim 1 . 

12. (Amended) A method for the discriminatory detection in vitro of antibodies 
directed against M. bovis BCG/M bovis or M. tuberculosis in a biological sample, 
comprising: 

(A) bringing the biological sample into contact with at least one product as defined in 
claim 1 1 , and 

(B) detecting the antigen-antibody complex formed. 

1 3. (Amended) A method for the discriminatory detection of a vaccination with M. 
bovis BCG or an infection by M. tuberculosis in a mammal, comprising: 

(A) preparing a biological sample containing cells. 
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(B) incubating the biological sample with at least one product as defined in claim 1 1 , 
and 

(C) detecting a cellular reaction indicating prior sensitization of the mammal to said 
product, wherein the cellular reaction is cell proliferation, synthesis of proteins, or both 
cell proliferation and synthesis of proteins such as gamma interferon. 

14. (Amended) A kit for the in vitro diagnosis of an M. tuberculosis infection in a 
mammal optionally vaccinated beforehand with M. bovis BCG comprising: 

(A) a product as defined in claim 1 1 ; 

(B) where appropriate, reagents for the constitution of the medium suitable for the 
immunological reaction; 

(C) reagents allowing the detection of the antigen-antibody complexes produced by 
the immunological reaction; 

(D) where appropriate, a reference biological sample (negative control) free of 
antibodies recognized by said product; and 

(E) where appropriate, a reference biological sample (positive control) containing a 
predetermined quantity of antibodies recognized by said product. 

1 5. (Amended) A mono- or polyclonal antibody, or its chimeric fragments or 
antibodies, wherein the antibodies or fragments are capable of specifically recognizing a 
product as defined in claim 1 1 . 

-6- 
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1 6. (Amended) A method for the discriminatory detection of the presence of an 
antigen of M. bovis BCG/ M. bovis or M. tuberculosis in a biological sample comprising: 

(A) bringing the biological sample into contact with an antibody as claimed in claim 
15; and 

(B) detecting the antigen-antibody complex formed. 

1 7. (Amended) A kit for the discriminatory detection of the presence of an antigen of 
M. bovis BCG/M. bovis or M. tuberculosis in a biological sample comprising: 

(A) an antibody as claimed in claim 15; 

(B) reagents for constituting the medium suitable for the immunological reaction; and 

(C) reagents allowing the detection of the antigen-antibody complexes produced by 
the immunological reaction. 

1 8. (Amended) An immunological composition, comprising at least one product as 
defined in claim 1 1 , and a phannaceutically compatible vehicle. 

1 9. (Amended) The vaccine of claim 1 8, further comprising one or more immunity 
adjuvants. 

20. (Amended) A method for the discriminatory detection and identification of M. 
bovis BCG or M. tuberculosis in a biological sample comprising the following steps: 
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(A) digesting with Hind\\\, of at least part of the genome of the mycobacterium 
present in a biological sample to be analyzed; and 

(B) analyzing restriction fragments thus obtained. 

21 . (Amended) The method as claimed in claim 20, wherein the analysis of the 
restriction fragments comprises counting said fragments, detennining the length of said 
fragments, or both counting said fragments and determining the length of said 
fragments. 

22. (Amended) The method of detection as claimed in claim 20, wherein the analysis 
of the restriction fragments comprises bringing the fragments into contact with at least 
one probe under stringent hybridization conditions and identifying the fragments 
hybridized. 

23. (Amended) The method as claimed in claim 22, wherein the probe is obtained by 
amplification of the genomic DNA with primers chosen from the group SEQ ID No. 31 , 
SEQ ID No. 32, SEQ ID No. 33, or SEQ ID No. 34 with the pair: 

(A) SEQ ID No. 31/SEQ ID No. 32 specific for DU1; or 

(B) SEQ ID No. 33/SEQ ID No. 34 specific for DU2. 

25. (Amended) The method as claimed in claim 20, wherein the analysis of the 
fragments obtained comprises amplification with primers and sequencing, wherein the 
primers are chosen from the group SEQ ID No. 19, SEQ ID No. 20, SEQ ID No. 21, 
SEQ ID No. 22, SEQ ID No. 23, SEQ ID No. 24, SEQ ID No. 25, SEQ ID No. 26, SEQ 
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ID No. 27, SEQ ID No. 28, SEQ ID No. 35, SEQ ID No. 36, SEQ ID No. 37, and SEQ ID 

No. 38, wherein 

(A) SEQ ID No. 19, SEQ ID No. 20/SEQ ID No. 21 are specific for JDU1; 

(B) SEQ ID No. 22, SEQ ID No. 24/SEQ ID No. 23, SEQ ID No. 25 are specific 
for JDU2A; 

(C) SEQ ID No. 26/SEQ ID No. 27, SEQ ID No. 28 are specific for JDU2B 

(D) SEQ ID No. 36, SEQ ID NO. 37, SEQ ID No. 38 are specific for DU1. 



LAW OFFICES 

finnegan, henderson, 
Farabow, Garrett, 

& DUNNER,LL.P. 

I3O0 I STREET, N. W. 
WASHINGTON, DC 20OO5 
202-406-4000 



Please add the following new claims: 

25. (NEW) The method of claim 13, wherein the biological sample containing 
cells is a sample of cells of the immune system. 

26. (NEW) The method of claim 25, wherein the cells of the immune system 
are T cells. 

27. (NEW) The method of claim 1 3, wherein the cellular reaction detected is 
synthesis of gamma-interferon. 

REMARKS 

Entry of this Amendment prior to examination is respectfully requested. The 
amendments to the claims were made to conform with United States patent practice. 
They do not add new matter to the claims. 
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The amendment of Claims 21 and 22 changes the description of analysis 
methods from "consists" to "comprises." Support for these changes can be found, for 
example, on page 20 of the specification. Here, methods of analysis are described in 
terms that contemplate the methods delineated in the claims, as well as other methods. 
Specifically, the specification reads, "As regards the analysis of restriction fragments, it 
may consist in counting said fragments and/or in determining their length." 
(Specification page 20, lines 4-6, emphasis added.) Furthermore, the next paragraph 
begins with the statement, "Another way of analyzing the restriction fragments resulting 
from the enzymatic digestion of the genome of the mycobacterium as described above 
consists in bringing said fragments into contact with at least one appropriate probe, 
covering for example the duplicated region, under hybridization conditions so as to then 
identify the number and size of the fragments which have hybridized." (Specification 
page 20, lines 19-21, emphasis added.) These statements demonstrate that analysis of 
the fragments can include various techniques and methods. 

If there is any fee due in connection with the filing of this Preliminary 
Amendment, please charge the fee to our Deposit Account No. 06-0916. 



Respectfully submitted, 



FINNEGAN, HENDERSON, FARABOW, 
GARRETT & DUNNER, L.L.P. 



FINNEGAN, Henderson, 
Farabow, Garrett, 

S DUNNER,L.L.P. 

130 0 I STREET, N. W. 



Dated: November 20, 2001 




WASHINGTON, DC 20005 
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Appendix to Amendment of November 20. 2001 

Please amend the claims as follows: 

1 . (Amended) A [Nucleotide] nucleotide or polynucleotide [sequences] sequence 
deleted from the genome of M. bovis BCG/M. bovis and present in the genome of M. 
tuberculosis or [conversely chosen from] a nucleotide or polynucleotide sequence of the 
following ORFs and genes: Rv2346c, Rv2347c, Rv2348c, pIcC, pIcB, pIcA, Rv2352c, 
Rv2353c, Rv3425, Rv3426, Rv3427c, Rv3428c, Rv1964, Rv1965, mce3, Rv1967, 
Rv1968, Rv1969, IprM, Rv1971, Rv1972, Rv1973, Rv1974, Rv1975, Rv1976c, Rv1977, 
ephA, Rv3618, Rv3619c, Rv3620c, Rv3621c, Rv3622c, IPqG, cobL, Rv2073c, Rv2074, 
Rv2075, echAI, Rv0223c, RvDI-ORFI, RvD1-0RF2, Rv2024c, pIcD, RvD2-0RF1, 
RvD2-ORF2, RvD2-ORF3, or Rv1758. 

2. (Amended) The nucleotide or polynucleotide sequences as claimed in claim 1 
grouped together in nucleotide regions RD5 to RD10 and RvDI and RvD2 according to 
the following distribution: 

[-] lA) RD5: Rv2346c, Rv2347c, Rv2348c, p/cC, pIcB, pIcA, Rv2352c, Rv2353c[,Ji 
[-] {B1RD6: Rv3425, Rv3426, Rv3427c, Rv3428c[,]i 

[-] (C1RD7: Rvl 964, Rvl 965, /77ce3, Rvl 967, Rvl 968, Rv1 969, /prM, Rvl 971, 
Rv1972, Rv1973, Rvl 974, Rv1975, Rvl 976c, Rv1977[,]: 

[-] (01 RD8: ephA, Rv3618, Rv3619c, Rv3620c, Rv3621c, Rv3622c, lpqG[,l 
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[-] (El RD9: cobL, Rv2073c, Rv2074, Rv2075c[,]: 

[-] lORDIO: ecM/, Rv0223c[,]; 

[-] (Gl RvDI: RvD1-0RF1, RvD1-0RF2, Rv2024 c: and 

[-] (H) RvD2: p/cD, RvD2-0RF1, RvD2-ORF2, RvD2-ORF3, Rv1758. 

3. (Amended) A method for the discriminatory detection and identification of M. 
bovis BCG/M bovis or M. tuberculosis in a biological sample, comprising [the following 
steps]: 

ra)KA) [isolation of] isolating the DNA from the biological sample to be analyzed or 
production of a cDNA from the RNA of the biological sample[,]: 

[b) ]{B) [detection of] detecting the DNA sequences of the mycobacterium present in said 
biological sample[.] : and 

[c) ](Cl [analysis of] analyzing said sequences with the nucleotide and polynucleotide 
sequences as claimed in claim 1 [or 2]. 

4. (Amended) The method as claimed in claim 3, [in which] wherein the detection of 
the mycobacterial DNA sequences is carried out using nucleotide sequences 
complementary to said DNA sequences. 

5. (Amended) The method as claimed in claim 3Jor 4, in which] wherein the 
detection of the mycobacterial DNA sequences is carried out by [amplification of these] 
amplifying the sequences using primers. 
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6. (Amended) The method as claimed in claim 5, [in which] wherein the primers 

have a nucleotide sequence chosen from the group comprising SEQ ID No. 1, SEQ ID 

No. 2. SEQ ID No. 3, SEQ ID No. 4, SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7, SEQ 
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ID No. 8, SEQ ID No. 9, SEQ ID No. 10, SEQ ID No. 11, SEQ ID No. 12, SEQ ID No. 
13, SEQ ID No. 14, SEQ ID No. 15, SEQ ID No. 16, SEQ ID No. 17, and SEQ ID No. 18 
[with] wherein : 

(A) the pair SEQ ID No. 1/SEQ ID No. 2 is specific for RD4: 

(B) the pair SEQ ID No. 3/SEQ ID No. 4 is specific for RD5[,]i 
(CI the pair SEQ ID No. 5/SEQ ID No. 6 js specific for RD6[,]i 
£01 the pair SEQ ID No. 7/SEQ ID No. 8 is specific for RD7[,]i 

[-] £E1 the pair SEQ ID No. 9/SEQ ID No. 1 0 is specific for RD8[,]: 

(O the pair SEQ ID No. 11 /SEQ ID No. 12 is specific for RD9[,]i 

[-] (Glthe pair SEQ ID No. 1 3/SEQ ID No. 14 is specific for RD10[,]: 

(HI the pair SEQ ID No. 1 5/SEQ ID No. 1 6 js specific for RvD1[,]i and 

(11 the pair SEQ ID No. 1 7/SEQ ID No. 18 ]s specific for RvD2[,L 

8. (Amended) A method for the discriminatory detection and identification of M. 
bovis BCG/7W. bovis or M. tuberculosis in a biological sample, comprising [the following 
steps]: 
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[a)1(A) bringing the biological sample to be analyzed into contact with at least one pair of 

primers as defined in claim 6 [or 7], the DNA contained in the sample having been, 

where appropriate, made accessible to the hybridization beforehand[,]; 

rb)1(B) [amplification of] amplifvinq the DNA of the mycobacterium[,]i_and 

fc)1(C) [visualization of] visualizing the amplification of the DNA fragments. 

9. (Amended) A kit for the discriminatory detection and identification of M. bovis 
BCG/M. bovis or M. tuberculosis in a biological sample comprising [the following 
elements]: 

[a) ] (A) at least one pair of primers as defined in claim 6 [or 7,]; 

[b) ](B)[the] reagents necessary to carry out a DNA amplification reaction[, ]: and 

[c) KC) optionally, the necessary components^ which make it possible to verify or 
compare the sequence [and/or]^ the size of the amplified fragment , or both the 
sequence and the size of the amplified fragment . 



10. (Amended) [The use of at least one pair of primers as defined in claim 6 or 7 for 
the amplification of] A method of amplifying a DNA sequence from M. bows BCG//W. 
bovis or M. tuberculosis comprising hybridizing at least one of the pair of primers of 
claim 6 to the DNA seguence . 



1 1 . (Amended) A product of expression of all or part of [the] a nucleotide or 
polynucleotide [sequences] sequence deleted from the genome of M, bovis BCG/M. 
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bovis and present in M. tuberculosis or [conversely as defined in] a product of 

expression of all or a part of an ORF or gene of claim 1 . 



12. (Amended) A method for the discriminatory detection in vitro of antibodies 
directed against M. bovis BCG/M. bovis or M. tuberculosis in a biological sample, 
comprising [the following steps]: 

ra)l(A) bringing the biological sample into contact with at least one product as defined in 
claim 1 1 , and 

[b)](Bl detecting the antigen-antibody complex formed. 

1 3. (Amended) A method for the discriminatory detection of a vaccination with M. 
bovis BCG or an infection by M. tuberculosis in a mammal, comprising [the following 
steps]: 

[a) ](Al [preparation of] preparing a biological sample containing cells[, more particularly 
cells of the immune system of said mammal and more particularly still T cells], 

[b) ](B) [incubation of] incubating the biological sample [of step a)] with at least one 
product as defined in claim 1 1 , and 

[c) ](Cl [detection of] detecting a cellular reaction indicating prior sensitization of the 
mammal to said product, [in particular] wherein the cellular reaction is cell proliferation 
[and/or]^ synthesis of proteins [such as gamma-interferoni , or both cell proliferation and 
synthesis of proteins such as gamma interferon . 
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14. (Amended) A kit for the in vitro diagnosis of an M. tuberculosis infection in a 
mammal optionally vaccinated beforehand with M. bovis BCG comprising: 

[a) 1(A) a product as defined in claim 11[,]; 

[b) ](B) where appropriate, [the] reagents for the constitution of the medium suitable for 
the immunological reactionf,]; 

fc)1(C) [the] reagents allowing the detection of the antigen-antibody complexes produced 
by the immunological reaction[,]; 

[d) ](Dl where appropriate, a reference biological sample (negative control) free of 
antibodies recognized by said productf,] ; and 

[e) ]£El where appropriate, a reference biological sample (positive control) containing a 
predetermined quantity of antibodies recognized by said product. 

1 5. (Amended) A mono- or polyclonal antibody, or its chimeric fragments or 
antibodies, [characterized in that they are] wherein the antibodies or fragments are 
capable of specifically recognizing a product as defined in claim 1 1 . 

16. (Amended) A method for the discriminatory detection of the presence of an 
antigen of M. bovis BCG/ M. bows or M. tuberculosis in a biological sample comprising 
[the following steps]: 

[a)1(A) bringing the biological sample into contact with an antibody as claimed in claim 
15ri ; and 
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[b) ]([Bi detecting the antigen-antibody complex formed. 

1 7. (Amended) A kit for the discriminatory detection of the presence of an antigen of 
M. bovis BCG/M. bovis or M. tuberculosis in a biological sample comprising [the 
following steps]: 

ra) l(A) an antibody as claimed in claim 15[,]i 

rb) 1(B) [the] reagents for constituting the medium suitable for the immunological 
reaction[.1 : and 

[c) ]{C)[the] reagents allowing the detection of the antigen-antibody complexes produced 
by the immunological reaction. 

1 8. (Amended) An immunological composition, [characterized in that it comprises] 
comprising at least one product as defined in claim 1 1 . and a pharmaceuticallv 
compatible vehicle . 



1 9. (Amended) [A] Jhe vaccine of claim 18 . [characterized in that it comprises] 
further comprising [at least one product as defined in claim 1 1 in combination with a 
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pharmaceutically compatible vehicle and, where appropriate,] one or more [appropriate] 
immunity adjuvants. 

20. (Amended) A method for the discriminatory detection and identification of M. 
bovis BCG or M. tuberculosis in a biological sample comprising the following steps: 
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[-] (Al [digestion] digesting with H/ndlll, of at least part of the genome of the 

mycobacterium present in a biological sample to be analyzed[,] ; and 

[-] (B) [analysis of the] analyzing restriction fragments thus obtained. 

21 . (Amended) The method as claimed in claim 20, [in which] wherein the analysis of 
the restriction fragments [consists in] comprises counting said fragments [and/or in]^. 
determining [their] the length of said fragments, or both counting said fragments and 
determining the length of said fragments . 

22. (Amended) The method [Method] of detection as claimed in [either of claims] 
claim 20^ [and 21 , in which] wherein the analysis of the restriction fragments [consists 
in] comprises bringing [them] the fragments into contact with at least one probe under 
stringent hybridization conditions and [in] identifying the [fragment parts or fragment] 
fragments hybridized. 



23. (Amended) [A] Ihe method as claimed in claim 22, wherein [characterized in 
that] the probe is obtained by amplification of the genomic DNA with primers chosen 
from the group SEQ ID No. 31, SEQ ID No. 32, SEQ ID No. 33^ or SEQ ID No. 34 with 
the pair: 

[-] (A) SEQ ID No. 31/SEQ ID No. 32 specific for DUIior 
[-] (Bl SEQ ID No. 33/SEQ ID No. 34 specific for DU2. 

25. (Amended) The method as claimed in claim 20, [characterized in that] wherein 
the analysis of the fragments obtained comprises [are amplified] amplification with 
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primers and sequencing, wherein the primers are chosen from the group SEQ ID No. 

19. SEQ ID No. 20. SEQ ID No. 21. SEQ ID No. 22. SEQ ID No. 23. SEQ ID No. 24. 

SEQ ID No. 25. SEQ ID No. 26. SEQ ID No. 27. SEQ ID No. 28, SEQ ID No. 35, SEQ 

ID No. 36, SEQ ID No. 37^ and SEQ ID No. 38 . wherein 

(A) SEQ ID No. 19. SEQ ID No. 20/SEQ ID No. 21 are specific for JDU1: 

(B) SEQ ID No. 22. SEQ ID No. 24/SEQ ID No. 23. SEQ ID No. 25 are specific 
for JDU2A: 

(C) SEQ ID No. 26/SEQ ID No. 27. SEQ ID No. 28 are specific for JDU2B 

(D) SEQ ID No. 36. SEQ ID NO. 37, SEQ ID No. 38 are specific for DU1 . 
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DE LETED SEQUENCES IN^ BOVIS BCG/M> BOVIS OR M. 
TTrngRCTTt^STS , METHOD PQR DETECTI NG MYCOBACTERIA USING 
THESE SEQUENCES AND VACCINES 



5 The subject of the present invention is the 
identification of nucleotide sequences which make it 
possible in particular to distinguish, in diagnostic 
terms, an immunization resulting from a- BCG vaccination 
from an M, tuberculosis infection. The sequences in 

10 question are specific either to M. Jbovis BCG/M. hovis, 
or to M, tuberculosis . The subject of the present 
invention is also a method for detecting the sequences 
in question, a method for detecting antibodies 
generated by the products of expression of these 

15 sequences and the kits for carrying out these methods. 
Finally, the subject of the present invention is novel 
vaccines . 



The high rate of mortality and morbidity caused by 

2 0 Mycobacterium tuberculosis , the etiological agent for 
tuberculosis, brings about the need to develop novel 
vaccines and ever shorter chemotherapeutic treatments. 
Indeed, the appearance of M. tuberculosis strains 
resistant to antituberculars and the increased risk in 

25 immunosuppressed patients, for example in AIDS 
patients, of developing tuberculosis, necessitates the 
development of rapid, specific and reliable methods for 
the diagnosis of tuberculosis and the development of 
novel vaccines. The conventional BCG vaccine is derived 

30 from a Mycobacterium bovis strain which was attenuated 
by repeated serial passages on bile potato-glycerinox 
agar (Calmette, 1927; Bloom and Fine, 1994) . However, 
in spite of almost 50 years of worldwide use, the 
reason for the attenuation of M. Jbovis BCG is still 

35 unknown. Questions remain as regards the protection 
conferred by the vaccine against pulmonary 
tuberculosis, with an efficacy of between 0 and 80% 
(Fine, 1994) , Furthermore, many BCG substrains exist 
and offer various levels of protection against 



tuberculosis in a mouse model (Lagranderie et al . , 
1996) . The attenuation of the original M. bovis strain 
may have been caused by mutations in the genome of the 
bacillus which were selected during serial passages of 
the strain, which mutations remained stable in the 
genome. However, as the original M. bovis strain has 
been lost, direct comparison between it and M. bovis 
BCG is impossible. In spite of that, the identification 
of genetic differences between M. bovis, M. bovis BCG 
and M, tuberculosis is likely to reveal locations whose 
alteration may have led to the attenuation of M. bovis 
BCG. 

The M, tuberculosis DNA has more than 99.9% 
homology with the DNA of the other members of the 
tuberculous complex (M. bovis, M. microtis, 

M. africanum) . Although closely related, these strains 
may be differentiated on the basis of their host range, 
their virulence for humans and their physiological 
characteristics (Heifets and Good, 1994) . As in the 
case of the attenuation of BCG, the genetic base for 
the phenotypic differences between the tubercle bacilli 
is mainly unknown. However, the wealth of information 
contained in the genomic sequence of M. tuberculosis 
H37RV led to the thought that the genetic variations 
between the strains was going to be revealed (Cole et 
al., 1998). Genomic comparison presents a powerful tool 
for such research studies since the whole genomes may 
be studied in preference to the study of genes in their 
individual forms. A previous comparative study of 
M. bovis and M. Jbovis BCG by substractive genomic 
hybridization has shown that three regions, designated 
RDl, RD2 and RD3 , were deleted in M. Jbovis BCG compared 
to M. jbovis (Mahairas et al . , 1996). However, the role, 
where appropriate, of these regions in the attenuation 
of M. bovis BCG has not been clearly established. 
Similarly, other studies of genomic differences between 
M. bovis r M. bovis BCG and M. tuberculosis have shown 
that many polymorphic locations existed between these 
strains (Philipp et al . , 1996). Although the exact 
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nature of these polymorphisms has not been elucidated, 
additional analyses have revealed that a polymorphism 
was due to the deletion of 12.7 kb in M. jbovis and BCG 
compared to M. tuberculosis (Brosch et al . , 1998). From 
5 that, it appears that there are two classes of 
deletion: those which are absent from BCG but present 
in M, jbovis and M. tuberculosis and those which are 
absent from M. jbovis and BCG but present in 
M, tuberculosis . 

10 

The bacterial artificial chromosome (BAC) library for 
M. tuberculosis H37Rv deposited at the CMCM under No. 
1-1945 on November 19, 1997 and described in 
application W09954487 demonstrates complete knowledge 

15 of the genomic sequence of M. tuberculosis and presents 
a potential as a tool for postgenomic applications such 
as genomic comparisons (Brosch et al , , 1998). To push 
the investigations into the genomic differences between 
M. tuberculosis and M. bovis BCG even further, the 

20 inventors prepared a BAC library from M. bovis BCG 
deposited on June 30, 1998 at the CNCM under No, 1-2049 
and described in application W09954487, This type of 
library indeed has certain advantages. Firstly, the BAC 
system can maintain large inserts of mycobacterial DNA, 

25 up to 120 kb. The 4.36 Mb of M. bovis BCG genome could 
therefore be represented in 50 to 60 clones, 
simplifying the storage and handling of the library. 
Secondly, the BAC system can allow, in complete 
confidence, replication of the inserts without 

3 0 genericing rearrangement or deletion in the clones. 
From that, alterations of the insert cannot be at the 
origin of an error for the duration in the genome. 
Thirdly, the positioning of the BAC clones on the 
M. jbovis BCG chromosome is likely to generate a map of 

3 5 clones which overlap, which ought to allow direct 
comparison of the local segments on the M. tuberculosis 
and M, Jbovis BCG genome, while being a resource of 
interest for the sequencing of the M, bovis BCG genome. 
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The construction of a BAC library for M. bovis BCG- 
Pasteur (1-2049) is described below as well as its use, 
in conjunction with the BAC library for M. tuberculosis 
H37RV (1-1945) , as a tool for genomic comparison. With 
5 this approach, the inventors have been able to identify 
novel deletions and insertions between the tubercle 
bacilli, which makes it possible to have a picture in 
two genomes of the dynamics and differentiation in the 
M. tuberculosis complex. 

10 

The main route for extracting biological information 
from the genome is the comparison between the genomes. 
The technology of biochips or ''"DNA chips'' (Chee et al., 
1996; DeRisi et al , , 1997) described, for example, in 

15 patents No. WO97/02357 and No. W097/2 9212 makes it 
possible to make alignments and to select the sequences 
of interest. However, the availability of a minimum set 
of BAC clones for the genomes of M, bovis BCG and 
M. tuberculosis H37Rv has offered the inventors ready- 

2 0 to-use tools for the abovementioned comparative 
studies. The BAC library for M. jbovis BCG contains more 
than 1500 clones with an average size of inserts of 
about 75 kb. 57 clones cover the BCG genome including a 
Hindu I fragment of 12 0 kb which was absent from the 

25 M. tuberculosis BAC library. The construction of BAC 
chips from the M. bovis BCG library should allow the 
inventors to extend their comparative studies relating 
to the tubercle bacillus. These fragments can be 
hybridized with the genomic DNA from clinical isolates 

30 from M, tuberculosis or epidemic strains in order to 
identify other deletions or rearrangements, and from 
that, allow a novel picture relating to the plasticity 
of the genome as well as the identification of the 
genes and the gene products which may be involved in 

35 the virulence. 



At the end of the experiments reported here, the 
inventors identified 10 locations or loci which are 
absent from M\ bovis BCG compared to M. tuberculosis . 
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Hybridizations with the genomic DNA of M. bovis 
revealed that 7 of these loci were also deleted in 
M, bovis compared to M. tuberculosis , Thus, in the text 
below, every time reference is made to the 
5 characteristics common to the genome of M. Jbovis BCG 
and to that of M. jbovis it will be indicated that this 
means the ^^genome of M* bovis BCG/M, bovis''. 

It was then found that 3 of the specific deletions 
10 which appeared in M. bovis BCG were identical to the 
RDl, RD2 and RD3 regions defined by the Stover team 
(Mahairas et al . , 1996). Thus, by retaining the 
preceding nomenclature the inventors called the other 
7 deletions of the M. Jbovis BCG/M. bovis genome, RD4 , 
15 RD5, RD6, RD7, RD8 , RD9 and RDIO . 

Other deletions have been found to be specific to the 
M, tuberculosis genome, it being understood that the 
'"corresponding" sequences were present in M. Jbovis 
20 BCG/M. bovis; they were called RvDl and RvD2 (tables 1 
and 2) . 

The RD5-RD10, RvDl and RvD2 deletions allowed the 
inventors to identify thoroughly the dynamics of the 

25 genome in the tubercle bacillus and gave information 
relating to the genetic bases of the phenotypic 
differentiation of the complex. The identification of 
RvDl and RvD2 as deletions of the M. tuberculosis H37Rv 
genome shows that the deletion process does not 

3 0 function in a single direction, and the loss of 
information can therefore occur both in bovine strains 
and in human strains. It is observed that 8 of the 
10 deletions detected are located in a region of the 
chromosome where termination of replication probably 

3 5 occurs. 

The inventors then, within each deleted region, 
identified several ORFs (or open reading frames) or 
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genes and they tried to determine the putative function 
of each of them (table 1) . 

The subject of the present invention is therefore 
5 nucleotide sequences deleted from the genome of 
M. bovis BCG/M. bovis and present in the genome of 
M. tuberculosis or conversely chosen from the following 
ORFs and genes: Rv2346c, Rv2347c, Rv2348c, plcC, plcB, 
plcA, RV2352C, Rv2353c, Rv3425, Rv3426, Rv3427c, 

10 RV3428C, RV1964, Rvl965, mce3 , Rvl967, Rvl968, Rvl969, 
IprM, RV1971, Rvl972, Rvl973, Rvl974, Rvl975, Rvl976c, 
RV1977, ephA, Rv3618, Rv3619c, Rv3620c, Rv3621c, 
RV3622C, IpgG, cobh, Rv2073c, Rv2074, Rv2075, echAI, 
RV0223C, RvDl-ORFl, RvDl-0RF2, Rv2024c, plcD, RvD2- 

15 ORFl, RVD2-ORF2, RvD2-ORF3, Rvl758 . 

The expression "nucleotide sequence" according to the 
present invention is understood to mean a double- 
stranded DNA, a single -stranded DNA and products of 
20 transcription of said DNAs . 

More particularly, the nucleotide sequences listed 
above are grouped into nucleotide regions according to 
the following distribution: 

25 

RD5:Rv2346c, Rv2347c, Rv2348c, plcC, plcB, plcA, 
RV2352C, Rv2353c, 

RD6: RV3425, Rv3426, Rv3427c, Rv3428c, 
RD7: RV1964, Rvl965, mce3 , Rvl967, Rvl968, Rvl969, 
30 IprM, RV1971, Rvl972, Rvl973, Rvl974, Rvl975, 

RV1976C, Rvl977, 

RD8: ephA, RV3518, Rv3619c, Rv3620c, Rv3621c, 
Rv3622c, IpgG, 

RD9: coJbL, RV2073C, Rv2074, Rv2075, 
35 - RDIO: echAI, Rv0223c, 

RvDl: RvDl-ORFl, RvDl-ORF2, Rv2024c 

RvD2: plcD, RvD2-0RFl, RvD2-ORF2, RvD2-ORF3, 
Rvl758 . 
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Advantageously, 3 of the deletions {RD5, RD6 and RD8) 
contain 6 genes encoding PE and PPE proteins. As it has 
been suggested that these proteins have a possible role 
in antigenic variation (Cole et al . , 1998), it can be 
5 deduced therefrom that these loci may represent sites 
of hypervariability between the tubercle strains. 

At least 9 proteins capable of being exported or 
exposed at the surface are encoded by RD4 to RDIO, 

10 which indicates that these polypeptides perhaps have a 
major role in the immune recognition of the bacillus. 
It has indeed been shown that secreted polypeptides can 
have a potential stimulatory role in the immune system 
and they are capable of playing a role of antigens 

15 known to become involved during the early stage of 
infection (Elhay et al . , 1998; Horwitz et al . , 1995; 
Rosenkrands et al . , 1998). 

The fact that RD5 and RD6 contain genes encoding 
2 0 proteins belonging to the ESAT-6 family, 14 of which 
are organized into 11 distinct loci, is particularly 
significant (F. Tekaia, S. Gordon, T. Garnier, 

R. Brosch, B,G. Barrell and S.T. Cole, submitted). 
ESAT-6 is a major T cell antigen which appears to be 
25 secreted by the virulent tubercle bacillus 
independently of the signal peptide (Harboe et al . , 
199G) . It accumulates in the extracellular medium 
during the early phases of growth and its gene is 
located in RDl, a region which is deleted from the 
30 genome of M. bovis BCG (Mahairas et al . , 1996; Philipp 
et al . , 1996). 3 of the 10 RD regions thus contain 
genes of the ESAT-6 family, which indicates that other 
sites of ESAT-6 genes can also give rise to deletions 
or rearrangement s . 

35 

The genomic sequence of M. tuberculosis H37Rv has 
moreover revealed the presence of 4 highly related 
genes encoding phoapholipase C enzymes called plcA, 
plcB, plcC ^nd plcD (Cole et al . , 1998) c Phospholipase 
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C has been recognized as a major virulence factor in a 
number of bacteria including Clostridum perfringens , 
Listeria monocytogenes and Pseudomonas aeruginosa, where 
it plays an intracellular role in the dissemination of 
5 bacterial cells, in intracellular survival and in 
cytolysis (Titball, 1993) . The RD5 deletion includes 
3 genes (plcA, plcB and plcC) , this region being absent 
from M. Jbovis, M. bovis BCG and M. microti. The 
detection of the phospholipase activity in 

10 M. tuberculosis, M. microti and M. bovis but not in 
M. bovis BCG has been previously described in (Johansen 
et al., 1996; Wheeler and Ratledge, 1992) as well as 
the role of the enzymes encoded by plcA and plcB (also 
known under the name mpcA and mpcB) in the hydrolysis 

15 both of phosphatidylcholine and sphingomyelin. The 
levels of phospholipase C activity which are detected 
in M. bovis are considerably less than those observed 
in M. tuberculosis which are in agreement with the loss 
of plcABC, the sphingomyelinase activity still being 

20 detectable. The sequence data presented here show that 
full-length phospholipase is encoded by the plcD gene 
in M. jbovis BCG-Pasteur and that its considerable 
sequence similarity with the products of plcA and plcB 
indicates that it is probably endowed both with 

25 phospholipase activity and with a sphingomyelinase 
activity. It is therefore probable that plcD may be 
responsible for the residual phospholipase C activity 
in strains exhibiting the RD5 deletion, such as 
M. jbovis, although it is difficult to link this 

30 interpretation to the observed absence of phospholipase 
C in spite of the presence of sphingomyelinase in the 
M. bovis BCG strain used in other studies (Johansen et 
al . , 1996; Wheeler and Raledge, 1992). Studies of 
expression with the cloned plcD gene ought to clarify 

3 5 this point. 



The mce gene has been described by the Riley team as 
encoding a putative protein of M. tuberculosis of the 
invasin . type, whose expression in E. coli allows the 
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invasion of HeLa cells (Arruda et al . , 1993), Three 
other Mce proteins have been identified as part of the 
genome sequencing project with their gene occupying the 
same position in the four large highly conserved 
5 operons comprising at least eight genes (Cole et al . , 
1998; Harboe et al . , 1996). It is difficult to deduce 
the effects of the loss of mce3 (RD7) on M. hovis, 
M. microti and M. Jbovis BCG because of the fact that 
the remaining three copies of mce could complement any 
10 loss of activity, unless the operons are differently 
expressed. However , it is of interest to note that RD7 
is absent from certain members of the M. tuberculosis 
complex which are not virulent for humans, suggesting 
that RD7 can play a specific role in human disease. 

15 

The genome of M. tuberculosis H3 7Rv also encodes six 
proteins (Eph-A-F) which show similarity with epoxide 
hydrolases whereas at least 21 enoyl-CoA hydratases 
(EchAl-21) and multiple aldehyde dehydrogenases are 

20 present (Cole et al . , 1998). The loss of ephA (RD8) , 
echAl and the aldehyde dehydrogenase encoded by Rv0223c 
(RDIO) in M. Jbovis BCG/M. bovis can therefore be 
compensated by other enzymes although the substrate 
specificity of the M. tuberculosis enzymes is unknown. 

25 The epoxide hydrolases are generally considered as 
detoxifying enzymes; a recent report has again showed 
that they play a role in the activation of leukotoxins 
(Moghaddam et al . , 1997), a toxic fatty acid produced 
by the leukocytes which are involved in respiratory 

30 distress syndrome in adults. However, the question of 
knowing if the M. tuJbercuIosris epoxide hydrolases can 
chemically modify host chemokines is without response. 
Alternatively, they can play a role in lipid 
detoxification of the products of peroxidation which 

35 are generated by oxygen radicals from activated 
macrophages . 

RD9 is a region deleted from the genomes of 
M. africanumj, M. bovis, M. bovis BCG and M. microti 



- 10 - 



compared to M. tuberculosis . Consequently, in contrast 
to the other RD regions, the location of M. africanum 
is close to M. jbovis, which indicates the presence of 
this strain between M. tuberculosis and M. Jbovis 
5 (Heifets and Good, 1994) . Similarly, the RD4 region can 
differentiate M. microti from the bovine strains {table 
2) . 



The proteins encoded by RD4 to RDIO can therefore have 
10 antigens of interest, allowing discrimination between 
individuals vaccinated with BCG and patients infected 
with M, tuberculosis . 



Thus, the subject of the present invention is also a 
15 method for the discriminatory detection and 
identification of M. bovis BCG/M. bovis or 
M. tujberculosis in a biological sample, comprising the 
following steps: 



2 0 a) isolation of the DNA from the biological 

sample to be analyzed or production of a 
cDNA from the RNA of the biological 
sample, 

b) detection of the DNA sequences of the 
25 mycobacterium present in said biological 

sample, 

c) analysis of said sequences. 



Preferably, in the context of the present invention, 
3 0 the biological sample consists of a fluid, for example 
human or animal serum, blood, a biopsy, bronchoalveolar 
fluid or pleural fluid. 



Analysis of the desired sequences may, for example, be 
35 carried out by agarose gel electrophoresis. If the 
presence of a DNA fragment migrating to the expected 
site is observed, it can be concluded that the analyzed 
sample contained microbacterial DNA. This analysis can 
also be carried out by the molecular hybridization 
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technique using a nucleic probe. This probe will be 
advantageously labeled with a nonradioactive (cold 
probe) or radioactive element. 

5 Advantageously, the detection of the mycobacterial DNA 
sequences will be carried out using nucleotide 
sequences complementary to said DNA sequences. By way 
of example, they may include labeled or nonlabeled 
nucleotide probes; they may also include primers for 
10 amplification. 

The amplification technique used may be PGR but also 
other alternative techniques such as the SDA {Strand 
Displacement Amplification) technique, the TAS 
15 technique (Transcription-based Amplification System) , 
the NASBA (Nucleic Acid Sequence Based Amplification) 
technique or the TMA (Transcription Mediated 
Amplification) technique . 

2 0 The primers in accordance with the invention have a 

nucleotide sequence chosen from the group comprising 
SEQ ID No. 1, SEQ ID No . 2, SEQ ID No . 3, SEQ ID No. 4, 
SEQ ID No. 5, SEQ ID No . 6, SEQ ID No . 7, SEQ ID No. 8, 
SEQ ID No. 9, SEQ ID No. 10, SEQ ID No. 11, 

25 SEQ ID No. 12, SEQ ID No. 13, SEQ ID No. 14, 

SEQ ID No. 15, SEQ ID No. 16, SEQ ID No. 17, and 
SEQ ID No. 18 with: 

the pair SEQ ID No. 1/SEQ ID No, 2 specific for 

RD4, 

3 0 - the pair SEQ ID No. 3 /SEQ ID No. 4 specific for 

RD5, 

the pair SEQ ID No. 5/SEQ ID No . 6 specific for 

RD6, 

the pair SEQ ID No. 7/SEQ ID No . 8 specific for 

35 RD7, 

the pair SEQ ID No. 9/SEQ ID No, 10 specific 

for RD8, 

the pair SEQ ID No. 11/SEQ ID No. 12 specific 

for RD9, 
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the pair SEQ ID No. 13/SEQ ID No. 14 specific 
for RDIO, 

the pair SEQ ID No, 15/SEQ ID No. 16 specific 
for RvDl, and 

5 - the pair SEQ ID No. 17/SEQ ID No. 18 specific 

for RvD2 , 

In a variant^ the subject of the invention is also a 
method for the discriminatory detection and 
10 identification of M. hovis BCG/M. bovis or 
M. tujberculosis in the biological sample comprising the 
following steps: 

a) bringing the biological sample to be analyzed 
15 into contact with at least one pair of primers 

as defined above, the DNA contained in the 
sample having been, where appropriate, made 
accessible to the hybridization beforehand, 

b) amplification of the DNA of the mycobacterium, 
20 c) visualization of the amplification of the DNA 

fragments . 

The amplified fragments may be identified by agarose or 
polyacrylamide gel electrophoresis by capillary 

2 5 electrophoresis or by a chromatographic technique (gel 

filtration, hydrophobic chromatography or ion-exchange 
chromatography) . The specification of the amplification 
may be controlled by molecular hybridization using 
probes, plasmids containing these sequences or their 
30 product of amplification. 

The amplified nucleotide fragments may be used as 
reagent in hybridization reactions in order to detect 
the presence, in a biological sample, of a target 

3 5 nucleic acid having sequences complementary to those of 

said amplified nucleotide fragments. 



- 13 - 



These probes and amplicons may be labeled or otherwise 
with radioactive elements or with nonradioactive 
molecules such as enzymes or fluorescent elements. 

5 The subject of the present invention is also a kit for 
the discriminatory detection and identification of 
M, bovis BCG/M. Jbovis or M. tuberculosis in a 
biological sample comprising the following components: 

10 a) at least one pair of primers as defined above, 

b) the reagents necessary to carry out a DNA 
amplification reaction, 

c) optionally, the necessary components which make 
it possible to verify or compare the sequence 

15 and/or the size of the amplified fragment. 

Indeed, in the context of the present invention, 
depending on the pair of primers used, it is possible 
to obtain very different results. Thus, the use of 

20 primers which are internal to the deletion, are 
described in the present invention for RD4, RD5 and 
RD8, is such that no amplification product is 
detectable in M. bovis BCG. However, the use of primers 
external to the region of deletion does not necessarily 

25 give the same result, as regards for example the size 
of the amplified fragment, depending on the size of the 
deleted region in M, bovis BCG. Thus, the use of the 
pair of primers SEQ ID No. 5/SEQ ID No. 6 for the 
detection of RD6 is likely to give rise to an amplicon 

3 0 in M. jbovis BCG of about 3 801 bp whereas the use of 
the pair of primers SEQ ID No. ll/SEQ ID No. 12 for the 
detection of RD9 will give rise in bovis BCG to an 
amplicon of about 1 018 bp. 

35 The subject of the invention is also the use of at 
least one pair of primers as defined above for the 
amplification of DNA sequences of M. bovis BCG/M. bovis 
or M, tuberculosis , 
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The benefit of the use of several pairs of primers will 
be quite obviously to cross the results obtained with 
each of them in order to refine the result of the 
analysis. Indeed, when it is indicated, in the context 
of the present invention, that some deletions are 
specific to M, hovis BCG/M. Jbovis, that is not 
completely accurate since some of them are also found 
in M, microti OV254, in M. tuh^rculosis CSU#93 and in 
M. africanum as well as certain clinical isolates 
(table 2) . Thus, the use of the pair of primers 
SEQ ID No. 1/SEQ ID No . 2 specific for the RD4 region 
will not give rise to amplicons of normal size with 
M. jbovis BCG/i\^. Jbovis in the biological sample. On the 
other hand, if the pair of primers used is SEQ ID No. 
5/SEQ ID No. 6 specific for RD6 and that amplicons of 
normal size are not found, it will not be possible, 
from this result only, to discriminate between the 
presence, in the biological sample, of M, bovis 
BCG/M. jbovis, M. microti OV2 54 and M, tuberculosis 
CSU#93. 

The discrimination will be more radical when it will 
involve determining if the mycobacterium present in the 
biological sample to be analyzed is M. bo-^is 
BCG/M. bovis or M. tujberculosis H37Rv because the pairs 
of primers SEQ ID No. 15/SEQ ID No. 16 and SEQ ID No. 
17/SEQ ID No. 18 are specific only for M. tuberculosis 
H37RV. Consequently, the absence of amplicon of normal 
size during the use of either of these pairs of primers 
may be considered as indicative of the presence of 
tuberculosis H37Rv in the biological sample 
analyzed. 

The subject of the present invention is also the 
products of expression of all or part of the nucleotide 
sequences deleted from the genome of M. bovis 
BCG/M. jbovis and present in M. tuberculosis or 
conversely as listed in table 1. 
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The expression ^^product of expression" is understood to 
mean any protein, polypeptide or polypeptide fragment 
resulting from the expression of all or part of the 
abovementioned nucleotide sequences and preferably 
5 exhibiting on least one of the following 
characteristics : 



- capacity to export or secrete by a 
mycobacterium and or be induced or repressed 

10 during infection with mycobacterium, and/or 

- capacity to induce, repress or modulate 
directly or indirectly a mycobacterial 
virulence factor, and/or 

- capacity to induce an immunogenicity reaction 
15 directed against a mycobacterium, and/or 

- capacity to be recognized by an antibody 
specific for a mycobacterium. 



Indeed, the subject of the present invention is also a 
2 0 method for the discriminatory detection in vitro of 
antibodies directed against M. Jbovis BCG/M. jbovls or 
M. tuberculosis in a biological sample, comprising the 
following steps : 



25 a) bringing the biological sample into contact 

with at least one product of expression as 
defined above, 
b) detecting of the antigen-antibody complex 
formed. 

30 

The subject of the invention is also a method for the 
discriminatory detection of a vaccination with M. jbovis 
BCG or an infection by M. tuberculosis in a mammal, 
comprising the following steps: 

35 

a) preparation of a biological sample containing 
cells, more particularly cells of the immune 
system of said mammal and more particularly 
still F cells. 
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b) incubation of the biological sample of step a) 
with at least one product of expression in 
accordance with the present invention, 

c) detection of a cellular reaction indicating 
prior sensitization of the mammal to said 
product, in particular cell proliferation 
and/or synthesis of proteins such as gamma- 
interf eron. 



10 Cell proliferation may be measured, for example, by 
incorporating ^H-Thymidine . 

The invention also relates to a kit for the in vitro 
diagnosis of an M. tuberculosis infection in a mammal 
15 optionally vaccinated beforehand with M. Jbovis BCG 
comprising : 



a) a product of expression in accordance with the 
present invention, 

2 0 b) where appropriate, the reagents for the 

constitution of the medium suitable for the 
immunological reaction, 

c) the reagents allowing the detection of the 
antigen -antibody complexes produced by the 

25 immunological reaction, 

d) where appropriate, a reference biological 
sample (negative control) free of antibodies 
recognized by said product, 

e) where appropriate, a reference biological 

3 0 sample (positive control) containing a 

predetermined quantity of antibodies recognized 
by said product . 



The reagents allowing the detection of the antigen- 
3 5 antibody complexes may carry a marker or may be capable 
of being recognized in turn by a labeled reagent, more 
particularly in the case where the antibody used is not 
labeled . 
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The subject of the invention is also mono- or 
polyclonal antibodies, their chimeric fragments or 
antibodies, capable of specifically recognizing a 
product of expression in accordance with the present 
5 invention . 



The present invention therefore also relates to a 
method for the discriminatory detection of the presence 
of an antigen of M. jbovis BCG/ M. hovis or 
10 M. tuberculosis in a biological sample comprising the 
following steps : 



a) bringing the biological sample into contact 
with an antibody in accordance with the 

15 invention, 

b) detecting the antigen- antibody complex formed. 

The invention also relates to the kit for the 
discriminatory detection of the presence of an antigen 
20 of M. bovis BCG/M. bovis or M, tuberculosis in a 
biological sample comprising the following steps: 



a) an antibody in accordance with the invention, 

b) the reagents for constituting the medium 
25 suitable for the immunological reaction, 

c) the reagents allowing the detection of the 
antigen-antibody complexes produced by the 
immunological reaction . 



3 0 The abovementioned reagents are well known to a person 
skilled in the art who will have no difficulty adapting 
them to the context of the present invention. 

The subject of the invention is also an immunological 
35 composition, characterized in that it comprises at 
least one product of expression in accordance with the 
invention. 
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Advantageously, the immunological composition in 
accordance with the invention enters into the 
composition of a vaccine when it is provided in 
combination with a pharmaceutically acceptable vehicle 
5 and optionally with one or more immunity adjuvant (s) 
such as alum or a representative of the family of 
muramylpeptides or incomplete Freund's adjuvant. 

The invention also relates to a vaccine comprising at 
10 least one product of expression in accordance with the 
invention in combination with a pharmaceutically 
compatible vehicle and, where appropriate, one or more 
appropriate immunity adjuvant (s) . 

15 Standard knowledge on the evolution of the 
M. tuberculosis complex is based on the hypothesis that 
M. tuberculosis is derived from bovis (Sreevatsan et 
al . , 1997). However, a distribution of RDl to RDIO 
among the tuberculous complex suggests that a linear 

2 0 evolution of AT. tuberculosis from M, bovis is too 

simplistic. It appears, indeed, in a more probable 
manner, that the two bacilli are derived from a common 
strain, that the deletions therefore reflect the 
adaptation of the bacilli to their particular niche, 
25 that is to say that the loss of RD4 to RDIO probably 
helped M. bovis to become a more potent pathogenic 
agent for bovines than M. tuberculosis . Functional 
genomic studies will determine which role these 
deletions play in the phenotypic differentiation of the 

3 0 tuberculous complex. 

Finally, the inventors have detected, still by 
comparing the BAG of tuberculosis H37Rv and the BAG 

of M. bovis BGG, two duplications in the genome of 
35 M. Jbovis BCG-Pasteur, called DUl and DU2 . They are 
duplications of regions of several tens of kilobases 
which appear to be absent both from the M. bovis and 
M. tuberculosis H37Rv type strain. The detection of 
these two duplications was made following digestion of 



the same clones for each BAG with Hindlll and analysis 
on a pulsed-f illed electrophoresis gel (PFGE) . These 
observations have been confirmed by hybridization of 
the digested chromosomal DNA derived from M. bovis BCG, 
from the type strain of M. bovis and M. tuberculosis 
H3 7RV with selected probes covering the duplicated 
regions. Primers specific for the rearranged regions 
were prepared and tested on the genomic DNA from 
additional isolates of M. bovis BCG and 
M. tuberculosis . 

It was determined that DUl and DU2 were present in 
three strains of M. bovis BCG including in M. bovis 
BCG-Pasteur and absent from three other substrains of 
M. jbovis BCG. 

These two duplications are also absent from the type 
strain of M, bovis and M, tuberculosis H3 7Rv. 

Thus, still in the context of the present invention in 
relation to the discriminatory detection of M. jbovis or 
M, tuberculosis r the subject of the invention is also a 
method for the discriminatory detection and 
identification of M, bovis BCG or M. tuberculosis in a 
biological sample comprising the following steps: 

digestion, with a restriction enzyme, of at 
least part of the genome of the mycobacterium 
present in a biological sample to be analyzed, 
and 

analysis of the restriction fragments thus 
obtained . 

The digestion with the restriction enzyme may indeed be 
carried out either on the entire genome of the 
mycobacterium, or on one or more clones of the library 
produced from the genome in question. 



Preferably, the restriction enzyme used in the context 
of the abovementioned method is Hindlll . 

As regards the analysis of restriction fragments, it 
may consist in counting said fragments and/or in 
determining their length. Indeed, as is explained 
below, Hindlll digestion of M. jbovis BCG gives rise to 
one fragment more than those obtained after Hindi I I 
digestion of the genome of M, tuberculosis H37Rv. The 
number of fragments thus obtained may also be 
complemented by the determination of their length. This 
may be carried out by means of techniques well known to 
persons skilled in the art, for example on a pulsed 
filled electrophoresis gel (PFGE) . It has thus been 
possible to determine that the additional fragment 
appearing after Hindi I I digestion of the genome of 
M. Jbovis BCG- Pasteur had a size of about 2 9 kb. 

Another way of analyzing the restriction fragments 
resulting from the enzymatic digestion of the genome of 
the mycobacterium as described above consists in 
bringing said fragments into contact with at least one 
appropriate probe, covering for example the duplicated 
region, under hybridization conditions so as to then 
identify the number and size of the fragments which 
have hybridized. The probes used for this purpose may 
be labeled or nonlabeled according to techniques well 
known to persons skilled in the art. 

Thus, the probe may be obtained by amplification of the 
genomic DNA with primers chosen from the group 
SEQ ID No. 31, SEQ ID No. 32, SEQ ID No. 33 or 
SEQ ID No. 34 with the pair: 

- SEQ ID No. 31/SEQ ID No. 32 specific for DUX 

- SEQ ID No. 33/SEQ ID No. 34 specific for DU2 

It is also possible to analyze the fragments by 
carrying out amplification of the fragments obtained 
with priuiers chosen from the group SEQ ID No. 19, 



SEQ ID No. 20, SEQ ID No. 21, SEQ ID No. 22, SEQ ID No. 
23, SEQ ID No. 24, SEQ ID No. 25, SEQ ID No. 26, 
SEQ ID No. 2 7 and SEQ ID No. 2 8 with: 

- SEQ ID No. 19, SEQ ID No. 2 0/SEQ ID No . 21 
specific for JDUl 

- SEQ ID No. 22, SEQ ID No. 24/SEQ ID No, 23, 
SEQ ID No. 25 specific for JDU2A 

- SEQ ID No. 26/SEQ ID No. 27, SEQ ID No. 28 
specific for JDU2B 

It is also possible to amplify the fragments obtained 
with primers chosen from the group SEQ ID No. 35, 
SEQ ID No. 36, SEQ ID No. 37 and SEQ ID No. 38 specific 
for DUl and then to analyze them by sequencing. 

LEGEND TO THE FIGURES 

FIGURES lA to ID: Map of the BAG of Mycobacterium bovis 
BCG-Pasteur superposed on the BAG of M. tujberculosis 
H3 7RV and on the cosmid maps (these figures should be 
read from left to right and from top to bottom, figure 
lA at the top left, figure IB at the top right, figure 
IG at the bottom left and figure ID at the bottom 
right) . 

The ''X" clones correspond to the clones in pBeloBACll 
of M. jbovis BGG, the "^XE" clones correspond to the 
clones in pBAGe3 . 6 of M. bovis BCG, the '"Rv" clones 
correspond to the clones in pBeloBAGll of 
M. tuberculosis, the clones ''Y" correspond to the 
clones in the cosmid pYUB328 of M. tuberculosis and the 
^'I" clones correspond to the clones in the cosmid 
pYUB412 of M. tuberculosis. The location of each 
deletion region is shown on the map. The scale bars 
indicate the position on the genome of M. tuberculosis, 

FIGITK.ES 2A to 2F: General view of the deleted regions 
RD5-RD10 . 
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The regions deleted from the genome of M. tuberculosis 
are delimited by arrows with a sequence flanking each 
deletion. The ORFs (open reading frames) are 
represented by "directed" boxes showing the direction 
5 of transcription as described above (Cole et al . , 
1998) . The putative functions and the families of the 
ORFs are described in table 3 . The stop codons are 
indicated by small vertical bars. 

10 FIGURE 3: Detection of the RD5 deletion. 

Digestions of the Rvl43 clone of the BAG with the 
endonucleases EcoRI , PstI and S^tuI revealed that 
fragments of 1 . 5 kb (jE?coRI) , 1 . 5 kb (PstI), 1.3 and 
15 2.7 kb (StuI) show no binding with M. bovis or M. bovis 
BCG DNA probes (the absent bands are indicated by 
arrows) . The size in kilobases (kb) is indicated on the 
left. 

2 0 FIGURE 4b: The RvDl and RvD2 regions 

A. Size polymorphism in amplicons generated by flanking 
primers (i) RvDl and (ii) RvD2 . PGR reactions were 
carried out using the GeneAmp XL PGR kit (Perkin Elmer) 
25 with DNA templates of M. tuberculosis H37Rv, M. bovis 
and M. bovis BGG-Pasteur in combination with primers 
described in table 3 . The size in kilobases is 
indicated on the left of each image. 

30 B. Structure of the ORFs of the loci of RvDl and RvD2 . 
The sequence of the two loci was determined from 
M. bovis BCG Pasteur, the flanking sequence in 
M. tuberculosis H3 7Rv being shown. The putative 
functions of the ORFs are described in table 1 with 

35 vertical barriers representing the stop codons. 

FIGURE 5: Duplicated region DUl in M. bovis BCG-Pasteur 
compared with the same region in M. tuberculosis H3 7Rv<. 
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FIGURE 6: Duplicated region DU2 in M. hovis BCG-Pasteur 
compared to the same region in M, tuberculosis H3 7Rv 

The present application is not limited to the above 
5 description and will be understood more clearly in the 
light of the examples below which should not in any 
manner be considered as limiting the present invention. 

EXAMPLES 
10 1. PROCEDURES AND RESULTS 

Construction of an M, hovis BCG-Pasteur BAC library 

Recent attempts for cloning very large inserts of 
15 mycobacterial DNA (120-180 kb) into the vector 
pBeloACll have resulted in failure (Brosch et al . , 
1998) . To establish if this size determination was due 
to the vector pBeloBACll, the inventors have tested in 
parallel the vector pBACe3 . 6 from BAC which uses the 
20 selection system sacB (Lawes and Maloy, 1995; Pelicic 
et al., 1996). Ligations carried out with fragments in 
the size ranges from 50 to 125 kb gave 5 to 10 times 
fewer transf ormants in pBACe3 , 6 than the control 
ligations using pBeloBACll (clones X) . The size of an 
25 insert in the clones pBACe3 . 6 was approximately between 
the interval 40-100 kb, similar to what was observed 
for pBeloBACll. This suggests that a size of about 
12 0 kb is indeed the upper size limit for the 
feasibility of the cloning of mycobacterial DNA. 

30 

Definition of the minimum set of BCG BACs 

100 clones randomly selected from pBeloBACll and 
pBACe3.6 libraries were sequenced at the ends to 
determine their position relative to the 
35 M. tuberculosis H37Rv chromosome (Cole et al . , 1998). 
This gave a minimum network of clones on the genome but 
with a preferential group in the vicinity of the sole 
operon rrn, which was also observed during the 
construction of the M. tujbercuiosis BAC map (Brosch et 



al., 1998). To fill the holes between the positioned 
clones, PGR primers were prepared, on the basis of the 
sequence of the complete M. tuJberculosris genome, so as 
to screen the BAG pools for specific clones. Using this 
methodology, clones covering more than 98% of the 
genome were isolated and positioned on the sequence of 
the M. tuberculosis genome. 

A minimum set of 57 M. bovis BCG clones was necessary 
to cover the genome (figure 1) . 56 of these clones are 
from the library pBeloBAGll and 1 is from the library 
pBACe3,6, namely XE015 (at about 680 kb) . Because 
previous experience had shown that the M. tuberculosis 
clones based on pBeloBAGll exhibited exceptional 
stability (Brosch et al . , 1998), these clones were 
preferred to the less characterized pBAGe3 . 6 system. 
The clone XE015 represents a region for which the 
pBeloBAGll clones could not be found. Two regions of 
about 36-52 kb, covered by no clone, are located at 
about 2 66 0 kb and about 2 96 0 kb on the genome. 
Previously, the isolation of cosmids and of 
M. tuberculosis BAG clones which covered the region at 
about 2 960 kb posed problems (Brosch et al . , 1998) 
suggesting that this region could contain genes which 
are detrimental to E. coli. 

Use of BAG chips for detecting deletions in the 
M. Jbovis BGG genome 

This involves the detection, from the M. tuberculosis 
H37RV BAG library, of 63 clones covering 97% of the 
genome (Brosch et al . , 1998). Analysis in silico of the 
sequence of the M, tuberculosis genome revealed that 
the digestion of these clones with either PvuII or 
EcoRl gave rise to a reasonable number of restriction 
fragments for each clone. The digested fragments 
migrated through agarose gels, gave rise to spots on 
membranes and were then hybridized with the ^^P- labeled 
genomic DNA of M. bovis BGG and M. bovis. The 



restriction fragments which did not hybridize with the 
DNA probes were considered to be absent from the 
genomes of M, bovis or BCG. As the initial screening 
used only two enzymes, it is possible that other 
deletions passed unnoticed. However, it is probable 
that all the important deletions (> 5 kb) were detected 
by this approach. 

From an analysis of the entire genome, 10 loci were 
identified which appeared to be absent from M. bovis 
BCG compared with M. tuberculosis. Hybridizations with 
the M. bovis genomic DNA revealed that 7 of these loci 
were also deleted in M. jbovis compared with 
M. tuberculosis. Closer analysis revealed that the 
three deletions specific to M. bovis BCG were identical 
to the RD1-RD3 regions defined by the Stover team 
(Mahairas et al . , 1996). Retaining the previous 
nomenclature, the 7 M. bovis/BCG deletions were 
designated RD4, RD5, RD6, RD7, RD8 , RD9 and RDIO 
(figures 1 and 2) , Sequencing reactions using the 
corresponding BAG clones as template were used to 
define precisely the terminal regions of the deletions 
(figure 2, table 1) . 



RD4 

RD4 is a 12.7 kb deletion previously characterized as a 
region absent from M. bovis and M. bovis BCG of the 
Pasteur, Glaxo and Denmark substrains (Brosch et al . , 
1998) . Among the proteins encoded by the 11 ORFs, some 
show resemblance with the enzymes involved in the 
synthesis of the lipopolysaccharides . To determine if 
RD4 was deleted only in the bovine strains, 
M. africanum, M. microti, M. tuberculosis CSU#93 and 27 
clinical isolates of M. tuberculosis were examined for 
the presence of the locus (table 2) . PGR reactions 
using primers internal to RD4 (table 3) generated only 
products in nonbovine strains. 



RD5 

RD5 has a size of 8 964 bp located between the genomic 
positions 2626067-2635031 (figure 3, table 1). The 
region contained 8 ORFs (table 1), three of them: plcA, 
plcB and plcC, encode phospholipase C enzymes whereas 
two others encode proteins belonging to the ESAT-6 and 
QILSS families respectively (Cole et al . , 1998; 
F. Tekaia, S. Gordon, T. Garni er, R. Brosch, 

E.G. Barrel! and S,T. Cole, submitted) . ORF Rv2352c 
encodes a PPE protein which is a member of the large 
family of proteins in M. tuberculosis (Cole et al . , 
1998) . Another protein of the PPE family (Rv2352c) is 
truncated in M. bovis BCG because of the fact that one 
of the deletions of the terminal parts is situated in 
the ORF. Searches in databases revealed that a segment 
of 3 013 bp of RD5 was virtually identical to the mpt40 
locus previously described, shown by Pattaroyo et al . 
to be absent in M. bovis and M. bovis BCG (Leao et al . , 
1995) . Primers intended to amplify the internal part of 
RD5 (table 3) were used in the PGR reactions with the 
DNA derived from various tubercle bacilli. No amplicon 
was produced from M, bovis, M. bovis BCG and M, microti 
templates (table 2) , indicting that M, micoti also 
lacks a RD5 locus. 



RD6 

RD6 was mapped at the level of the insertion sequence 
IS1532, an IS element which is absent in M. microti, 
M. bovis and M. bovis BCG (Gordon et al . , 1998) (table 
1) . The delimiting of the size of the deletion was 
complicated by the presence of repeat regions directly 
flanking the IS element and requiring the use of 
primers outside the repeat region (table 3) , These 
primers amplified the products in M. Jbovis and M. bovis 
BCG which are about 5 kb smaller than the 
M. tuberculosis amplicon. Primer walking was used to 
precisely locate the junctions of deletions and 
revealed a deletion of 4 928 b in M. bovis and M. bovis 
BCG (genomic position of M.. tuberculosis 3846807- 



3841879) . Like the 1S1532 element, it was determined 
that RD6 contained two genes encoding PPE proteins 
(Rv3425 and Rv3426) and part of Rv3424c whose function 
is unknown (table 1) . 

RD7 

The RD2 deletion described in Mahairas et al . (Mahairas 
et al., 1996) was mapped in the M. tuberculosis Rv420 
clone and the results obtained by the inventors have 
suggested the existence of an additional deletion in 
M. bovis BCG which is very close to RD2 . Hybridizations 
were repeated using the M. bovis genomic DNA as probe 
since this strain contains RD2 sequences, thus 
simplifying the identification of other deleted 
fragments. This analysis (figure 2) revealed a 
12 718 bp deletion in M. jbovis BCG compared with 
M, tuberculosis, located 336 bp upstream of RD2, at 
positions 2208003-2220721 on the M. tuberculosis 
genome. The RD7 region contains 14 ORFs (table 3) . 8 of 
them (Rvl964-1971) constitutes part of the operon with 
the putative invasine gene mceS (Cole et al . , 1998). 
The ORFs Rvl968, Rvl969, Rvl971, Rvl973 and Rvl975 
could encode possible proteins exported or expressed at 
the surface since they contain putative N-terminal 
signal sequences or membrane anchoring. They are all 
members of the Mce family and have common properties 
(Tekaia et al . , submitted). Interestingly, Mce3 and 
Rvl968 contain the tripeptide ^'RGD'' or Arg-Gly-Asp, a 
motif involved in cellular attachment (Ohno, 1995; 
Relman et al . , 1989). Rvl977, which is truncated by 
RD7, encodes a protein exhibiting similarities (38.5% 
identity over 275 amino acids) with a hypothetical 
polypeptide and the PCC 6 803 strain of Synechocystis . 
PGR analysis (table 2) revealed that RD7 was present in 
30 clinical isolates of M. tuberculosis as well as in 
M. africanum and M. tuberculosis CSU#93 . The locus was 
however absent from microti, M. bovis and M. bovis 
BCG. 
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RD8 

RD8 covers a region of 5 895 bp positions on the 
genomic sequence of M. tuberulosis at 4556836-4062731. 
The deletion contains 6 ORFs (figure 2, table 1) with a 
seventh ORF: IpgQ which encodes lipoprotein truncated 
at its 5' end by the deletion. Among these 6 ORFs, 
Rv3 619c and Rv3 62 0c encode members of the ESAT-6 and 
QILSS families (Cole et al . 1998, Harboe et al . , 1996; 
F, Tekaia, et al . , submitted) and two other ORFs encode 
PE and PRE proteins. The other 2 ORFs, ephA and Rv3 618, 
encode a putative epoxide hydrolase and a monooxygenase 
respectively. PGR analysis directed against an internal 
segment of RD8 (table 2) revealed that the region was 
also deleted in the M. bo-vis and M, microti wild type. 

RD9 and RDIO 

The 2 03 0 bp deletion spanned by RD9 covers 2 ORFs, 
Rv2037c and Rv2074, which probably encode an 
oxidoreductase and an unknown protein respectively 
(table 1) . 2 additional ORFs are truncated by RD9 : 
Rv2 075c encodes a putative exported protein whereas 
cohL encodes a precorrin methyltransf erase involved in 
the synthesis of cobalamin. PGR analysis with flanking 
primers (table 3) revealed that RD9 is also present in 
M. africanum and M. microti (table 2) . RDIO is a 
1 903 bp deletion which truncates 2 ORFs, GchAl and 
Rv0223, which encode an enoyl-CoA hydratase and an 
aldehyde dehydrogenase respectively (table 1) . PGR 
reactions revealed that RDIO was absent from M. microti 
as well as from M. bovis and BGG. 

Other differences between M. tuberculosis and BGG 
Given the fact that the genomes of tubercle bacilli are 
highly conserved (Sreevatsan et al . , 1997), direct 
local comparison may be undertaken in a simple and 
targeted manner by examining the restriction enzyme 
profiles generated from M. tuberculosis and M. jbovis 
BGG BAG clones which cover the same regions. 
Comparative mapping of the region covered by the clone 



- 29 - 

X318 has identified this region as being very different 
from the corresponding M. tuberculosis clones. The data 
relating to the terminal sequences from the clone X066 
revealed that if its terminal sequence SP6 made it 
possible to position about 2 380 kb on the 
M, tuberculosis template, the terminal sequence T7 
would not generate any significant similarity with any 
sequence of H3 7Rv, indicating that one end of X066 was 
internal to the DNA segment present in BCG but absent 
from H3 7RV. Sequencing primers were used to walk along 
the BCG BAG clone X318 (figure 1) and revealed the 
insertion at the 2238724 bp position in the 
M. tuberculosis genome. Used in PGR reactions, the 
M. bovis BGG and M. bovis templates generated larger 
amplicons of about 5 kb than the product of 
M. tuberculosis H37Rv (figure 4A) . The whole insert, 
designated RvDl, was sequenced from X318 BGG. The 
insert of 5 014 bp extended the M. tuberculosis Rv2024c 
ORF by 2.8 kb and contained an additional ORF, 
RVD1-0RF2, of 954 bp (table 1, Figure 4B) . RvDl-ORFl 
can be superposed over the 5' joining point of the 
deletion and extends inside the flanking DNA. FASTA 
analysis revealed that RvDl-ORFl and 0RF2 encode 
proteins exhibiting no significant similarity with 
other proteins in databases. Extended Rv2 024c showed 
certain similarities (36.5% identity of 946 amino 
acids) with a Helicobacter pylori hypothetic protein 
(accession No. 025380). The loss of this sequence 
clearly had no consequence on the virulence of 
M. tuberculosis H37Rv since this strain is fully 
virulent in animal models. PGR analysis specific for 
the locus demonstrated its presence in several but not 
in all the clinical isolates and in all the BGG strains 
tested (table 2) . 

An ORF encoding a phospholipase, plcD, is interrupted 
by 1S6110 in M. tuberculosis H37Rv (Gole et al . , 1998). 
To determine if plcD was intact in other members of the 
tuberculous complex, primers flanking the insertion 
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site 1S6110 (table 3) were used in PGR reactions with 
M. bovis, M. hovis BCG and M. tuberculosis H37Rv. This 
revealed polymorphism at the locus plcD where the M. 
bovis and M. jbovis BCG amplicons were about 5 kb larger 
5 than the product of H37Rv (figure 4A) . This deletion of 
about 5 kb in the M, tuberculosis H3 7Rv genome compared 
with M. Jbovis BCG was called RvD2 . The sequencing of 
the M, bovis BCG BAG clone X086 revealed that RvD2 was 
positioned between bases 1987699-19890045 in the 
10 M, tuberculosis genome. The region comprises 6,5 kb and 
contains 3 ORFs encoding an unknown protein, an 
oxidoreductase and a membrane protein, and it extends 
the plcD gene in order to encode a product of 514 amino 
acids (Figure 4B, table 1) . 

15 

II. EXPERIMENTAL DATA 

Bacterial strains and plasmids 

The strains of the M. tuberculosis complex 
(Mycobacterium africanum, Mycobacterium microti, 
2 0 Mycojbacteriuin tuberculosis , Mycobacterium bovis and 
Mycobacterium bovis BCG) and substrains of M, bovis BCG 
(Danemark, Glaxo, Russe, Japonais, Pasteur and Moreau) 
were obtained from laboratory stalks (Unite de G,M.B., 
Institut Pasteur) . Mycobacterium tuberculosis CSU#93 

2 5 was received from John BELISLE, Department of 

Microbiology, Colorado State University, Fort Collins, 
CO 80523. Nonepidemic clinical isolates of 

M. tuberculosis were provided by Beate HEYM, Ambroise 
Pare hospital, 9 avenue Charles de Gaulle, 
30 92104 BOULOGNE CEDEX, FRANCE. The BAC vectors 
pBeloBACll (Kim et al . , 1996) and PBACe3 . 6 (Genbank 
accession No, U80929) were given by H. SHIZUYA, 
Department of Biology, California Institute of 
Technology, Pasadena, CA, and P. de JONG, Roswell Park 

3 5 Cancer Institute, Human Genetics Department, Buffalo, 

NY, respectively. The vectors and the derived 
recombinants were maintained in E. coli DHIOB. 
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Preparation of the genomic DNA 

The preparation of the genomic DNA in agarose cubes 
from hovis BCG Pasteur was carried out as previously- 
indicated (Philipp et al., 1996; Philipp et al . , 1996) 
5 but with two proteinase K digestions for 24 h each, 
rather than one digestion of 48 h. The cubes were 
stored in 0.2 M EDTA at 4''C and washed twice in 50 ml 
of Tris-EDTA (pH 8) /Triton X-100 (0.1%) at 4°C for 1 h, 
and then washed twice in 50 ml of a buffer of 
10 restriction enzyme Triton X-100 (0,1%) for 1 h at room 
temperature before use. 

Construction of the BAG library 

A DNA vector was prepared as previously indicated (Woo 

15 et al., 1994). Partial Hindlll and EcdRl digestions of 
the DNA in agarose, for cloning into pBeloBACll and 
pBACe3.6 respectively, and then contour-clamped 
homogeneous electric field (CHEF) migration were 
carried out as previously described (Brosch et al . , 

20 1998) . 5 zones, 50-75 kb, 75-100 kb, 100-125 kb and 
150-170 kb were excized from agarose gels and stored in 
TE at 4''C. Ligations with the vectors pBeloBACll and 
pBACeB . 6 and transformation in E. coli DHIOB were 
carried out as previously described (Brosch et al . , 

25 1998) . The pBeloBACll transf ormants were selected on LB 
agar containing 12.5 p,g/ml of chloramphenicol, 50 |ig/ml 
of X-gal and 25 |ig/ml of IPTG, and were screened with 
white recombinant colonies. The pBACe3.6 transf ormants 
were selected on LB agar containing 12.5 [xg of 

30 chloramphenicol and 5% of sucrose. The recombinant 
clones were subcultured, in duplicates, in 96-well 
microtiter plates containing a 2xYT medium with 12.5 |iig 
of chloramphenicol and were incubated overnight at 
37 °C, An equal volume of glycerol at 80% was then added 

35 to the wells and a plate was stored at -80 °C as master 
plate. The remaining plate was used to make sets of 
clones for screening purposes (see above) . 
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Preparation of DNA from recombinants and examination of 
the size of the inserts 

A recombinant carrying a DNA plasmid was prepared from 
4 0 ml of culture and was grown on the 2xYT medium 
5 containing 12 . 5 |ig of chloramphenicol as previously 
described (Brosch et al . , 1998). 100-200 ng of DNA were 
digested with Dral (Gibco-BRL) and the restriction 
products were separated on a pulsed-field 
electrophoresis gel (PFGE) with an LKB-Pharmacia CHEF 

10 apparatus using a 1% (weight /volume) and a pulse of 
4 seconds for 15 h at 6.25 V/cm. PFGE markers of 
average low size (New England Biolabs) were used as 
size standard. The sizes of the inserts were estimated 
after ethidium bromide staining and visualization with 

15 UV light. 

Sequencing reactions 

Sequencing reactions were carried out as previously 
indicated (Brosch et al . , 1998). For clones isolated 

2 0 from the pBeloBACll library, the primers SP6 and T7 

were used to sequence the ends of the inserts whereas 
for the clones pBACe3.6, the primers derived from the 
vector were used. The reactions were loaded onto 6% 
polyacryl amide gels and electrophoresis was carried out 
25 with a 373A or 377 automated DNA sequencer (Applied 
Biosystems) for 10 to 12 h. The reactions generally 
gave between 3 00 and 600 bp of readable sequences. 

BAG chips 

3 0 The overlapping clones from the pBeloBACll library of 

M. tuberculosis H37Rv (Brosh et al . , 1998) were 
selected so that 97% of the M. tuberculosis genome was 
represented. The DNA prepared from these clones was 
digested with EcoRI (Gibco-BRL) or PvuII (Gibco-BRL) 
35 and was run on 0.8% agarose gels 25 cm in length, at a 
low voltage for 12 to 16 h. After staining and 
visualization under UV, the agarose gels were treated 
by the standard Southern method and the DNAs were 
transferred onto Hybond-C Extra nitrocellulose 
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meTTibranes (Amersham) . The DNA was fixed on the membrane 
by heating at 8 0 °C for 2 h. The genomic DNA of 
M. tuberculosis H3 7Rv, Mycobacterium bovis ATCC 19210 
and M. bovis BCG Pasteur was labeled with [a-"^^?] dCTP 
5 using the Prime-It II kit (Stratagene) . The probes were 
purified on a PIO column (Biorad) before use. 
Hybridizations were carried out as previously described 
(Philipp et al . , 1996). The purified labeled probes 
were dissolved in a 5xSSC solution (IxSSC is 0.5 M 

10 sodium chloride; 0.015 M sodium citrate), and 50% 
(weight /volume) formamide. The hybridization was 
carried out at 37 °C, and the membranes were washed for 
15 min at room temperature in 2xSSC/0.1% SDS and then 
in lxSSC/0.1% SDS and finally in O.lxSSC/0.1% SDS. The 

15 results were interpreted from autoradiograms . In 
general, it was difficult to visualize on the 
autoradiograms the fragments of less than 1 kb, 
especially after repeated use of the membranes. The 
fragments larger than 1 kb gave clearer results. The 

20 clones which appeared to contain fragments with no 
counterpart in M, bovis BCG were subcultured for 
subsequent analyses. The genomic sequence allowed the 
establishment of restriction maps with the aim of 
determining the suspected regions of deletion, making 

2 5 it possible to select enzymes giving the best 

resolution of the regions. Clones could thus be 
digested with a second range of enzymes (generally PstI 
and StuI, with EcoRl included as a control) and 
hybridized in order to obtain a more accurate size of 

3 0 the deletion. The sequencing primers flanking the 

deletions were thus designated and used in the 
sequencing reactions with the corresponding BAC of 
M. bovis BCG used as template. 

3 5 PGR analysis 

The primers used in the PGR reactions are listed in 
tables 3 and 4. The reactions for expected products of 
less than 3 kb were carried out with a standard Taq 
polymerase (Boehringer Mannheim) . The reactions used 
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5 ]il of lOxPCR buffer (100 mM p-mercaptoethanol , 600 mM 
Tris-HCl, pH 8.8), 20 tnM MgCls, 170 mM (^4)2804, 5 pi 
of nucleotide mixture at 2 0 mM, 0.2 jjM of each primer, 
10-50 ng of DNA template, DMSO at 10%, 0.5 unit of Taq 
5 polymerase and sterile distilled water to 50 pi. The 
heat cycles were carried out with a PTC- 100 amplifier 
(MJ Inc.) with an initial denaturation step of 
90 seconds at 95 °C followed by 35 cycles of 30 seconds 
at 95°C, 1 min at 55°C and 2 min at 72°C. 

10 

The PGR reactions capable of giving rise to products 
greater than 3 kb were carried out using the PGR 
GeneAmp XL kit (Perkin Elmer) . The reactions were 
initiated according to the manufacturer's instructions, 

15 with 0.8 mM Mg(0Ac)2/ 0 . 2 pM of each primer and 10- 
30 ng of DNA template per reaction. The heat cycles 
were carried out at 96 ''C for 1 min, then followed by 15 
cycles in 2 stages at 94 ""C for 15 seconds and VO^'C for 
7 min, followed by 20 cycles in 2 stages at 94 °C for 15 

20 seconds and 70 °C for 8 min plus 15 seconds per cycle. 

Computer analysis 

The data relating to the sequences were transferred 
from the automated ABI373A sequencer to the Sun or 

2 5 Digital work station and edited using the TED software 
from the Staden package . The edited sequences were 
compared with the inventors' database relating to M. 
tuberculosis (H3 7Rv.dbs) to determine the relative 
positions of the terminal sequences on the sequence of 

30 the M. tuberculosis genome. With this method, a map of 
the M, jbovis BCG BAG clones was constructed using the 
M. tuberculosis H3 7Rv sequence as template. 

To make the genomic comparison, digestions in silico 
35 using restriction enzymes were carried out with the NIP 

(Nuc3,eotide Interpretation Program) software using the 
Staden package. The Display and Analysis program 

(DIANA) of the Sanger Centre, Cambridge, UK, was used 
to interpret the sequence data. 
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Accession numbers for the DNA sequences 

The nucleotide sequences which flank each RD locus in 
M. bovis BCG have been deposited in the EMBL database. 
5 The accession numbers for RD5 , RD6 , RD7 , RD8 , RD9 and 
RDIO are AJ007300, AJ131209, AJ007301, AJ131210, 
Y181604 and AJ132559, respectively. The sequences of 
RvDl and RvD2 in M. Jbovis BCG have been deposited under 
the Nos. Y18605 and U18606 respectively. 

10 

Detection of the duplicated region DUl 

DUl was the first depleted region observed when the 
bands for Jfindlll digestion of the clone X038 of the 
BCG BAC and of the clone Rvl3 of the H3 7Rv BAG were 
15 compared. The two clones X038 and Rvl3 had identical 
terminal sequences, extending from position Jfindlll - 
4 367 kb to the Hindlll site 0 027 kb (via 4411529 b) 
on the sequence of the genome of M. tuberculosis H3 7Rv 
(MTBH37RV) , spanning the replication origin. 

20 

Analysis in silica of the Hindlll restriction sites for 
the region given between 4 3 67 kb and - 0 02 7 kb 
revealed a Hindlll site at position ~ 4 404 kb. 
Consequently, digestion of these clones should show two 

25 restriction fragments plus the band specific for the 
vector at about 8 kb. That was the case for the H37Rv 
Rvl3 clone. By contrast, the clone X03 8 of the BCG BAC 
showed three bands plus the band specific for the 
vector at about 8 kb, two of them were identical to the 

30 Rvl3 scheme. The additional band has a size of about 
29 kb. Additional PFGE analyses using Dral revealed 
that X038 is indeed 29 kb longer than Rvl3 . For PGR 
screening of the BCG BAC pools using selected 
oligonucleotides, the inventors were able to identify 

35 three further clones X covering the parts of this 
genomic region in BCG: X585, X592, X703 . The terminal 
sequence and the PFGE analysis showed that each of 
these clones contains an insert of a different size. 



- 36 - 



corresponding to the three bands observed in the 
results of digestion of X038 (Figure 5) , 

The terminal sequences are: X585 4 367-4 404 kb) ; 
5 X592 4 404-4404 kb) ; X703 4 404-0 027 kb) . The 

sequences were repeated twice with the same results. 
The strange result according to which the clone X592 
has T7 and SP6 and in the same genomic region could be 
explained by duplication of this genomic region in BCG 

10 and also give information on the extent of the 
rearrangement. Additional comparative restriction 
analyses of the clones X585, X592, X703 and X038 with 
EcoRI revealed that X592 and X703 have the same 
restriction pattern with the exception of a 10 kb band 

15 present in X703 but absent from X592, On the basis of 
these results, primers were prepared for the 
amplification of the joining region where the 
duplicated DNA segment joins the unique region. 

20 PGR analysis with primers at 16.000 and at 4398.700 bp 
(SEQ ID No. 19 and 21) gave a product of an expected 
size from the clone X592 and also on the BCG-Pasteur 
genomic DNA. Sequencing of the PGR products obtained 
directly on the BAG DNA of the clone X592 revealed that 

25 the junction was indeed located at bases 
16.732/43 98.593 compared with the genomic sequence of 
H3 7RV and that this genomic rearrangement resulted in 
the truncation of the R-vSBlO and pknB genes. However, 
since this rearrangement is a tandem duplication, 

3 0 intact copies of the two genes could be present in the 
neighboring regions. PGR analysis with flanking primers 
of the Rv3920 and pknB genes confirmed this when the 
genomic DNA of BGG-Pasteur and of Af. tuberculosis H37Rv 
were used. Additional proof of the rearrangement was 

35 obtained using a PGR fragment of 500 bp spanning the 
oriC region of H3 7Rv as ^^P- labeled probe in order to 
hybridize the products of digestion of the genomic DNA 
of M. tuberculosis, M. Jbovis and M. bovis BCG-Pasteur 
under the stringent conditions previously describeci 
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(Philipp et al . , 1996), Whereas in M. bovis and M. 
tuberculosis a band having an average size of about 
3 5 kb was detected, in M. bovis BCG-Pasteur two bands 
hybridized, one of approximately 3 5 kb and the other of 
5 2 9 kb. In conclusion, DUl corresponds to a tandem 
duplication of 29668 bp which results in merodiploidy 
for the sigM-pahA region (Rv3911~Rv0013) . 

PGR analysis using primers at 16,000F (SEQ ID No. 19) 
10 or 16.500F (SEQ ID No. 20) (sense primers) and at 
4398. 770R (SEQ ID No. 21) (reverse primer) on the 
genomic DNA of various BCG strains (Pasteur, Glaxo, 
Copenhagen, Russia, Prague, Japan) have revealed that 
products were only obtained from three strains, 
15 including M. bovis BCG-Pasteur. The other three 
substrates always gave negative results despite the 
confirmation of the positive controls. 

As expected, the M. bovis and M. tuberculosis H37Rv 
20 type strains were also always negative. A summary of 
the mapping data is shown in figure 5. 

The dnaA-dnaN region is generally regarded as the 
functional replication origin in mycobacteria since 

25 after insertion into plasmids whose own replication 
origin is absent, the capacity to autonomously 
replicate is restored. Since BCG-Pasteur is diploid for 
the dnaA-dnaN region, the inventors studied whether 
differences existed between the nucleotides of the two 

3 0 copies present on the two BAC X592 and X703 clones. 
Analysis of the BAC DNA sequence using primers of 
flanking and internal regions of the intergenic 
dnaA-dnaN region revealed no difference between the two 
copies of the minimal oriC region. Furthermore, these 

35 sequences were identical to those disclosed in the 
literature for this BCG strain. This study suggests 
that the two copies of oriC ought to be functional. 



■9 
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Detection of the duplicated region DU2 

The second big genomic rearrangement observed in the 
M. bovis BCG- Pasteur chromosome was found by analyzing 
several BCG BAG clones covering a genomic region of 
5 about 200 kb (3 550-3 750 kb) . Their sizes, evaluated 
by PFGE, did not conform to those expected from the 
H3 7RV genome and data relating to the terminal 
sequences. Direct comparisons were complicated by the 
presence of an IS 6'II0 element in this region of the M. 
10 tuberculosis H3 7Rv chromosome which led to a small RvD5 
deletion. 

The terminal sequences of BAG X4 95 were both located 
around the Hindi 1 1 site at 3 5 94 kb, whereas the PFGE 

15 results showed that the clone has a size of about 
106 kb, containing three Hindi I I fragments, of about 
37.5 kb, about 37 kb and about 24 kb in addition to the 
vector. The 24 kb band was about 2 kb longer than the 
fragment corresponding to Hindlll of 22 kb in Rv403. 

2 0 This observation led to the hypothesis that the genomic 
region at around 3 594 kb must have been duplicated, 
giving rise to the introduction of a novel Hindi I I site 
at the point where the clone X495 ends. To show this, 
several primers in the chromosomal region of 3 58 9 kb 

2 5 to 3 594 kb were tested for the sequencing of the BAG 

X495 DNA and a junction (JDU2A) was identified at bases 
3690124/3590900 relative to the genomic sequence of 
H37RV. This led to an interruption of the IpdA (Rv3303) 
gene but the PGR results indicated that an intact copy 

3 0 of this gene is present in the duplicated region. 

Systematic analysis of other clones in the vicinity 
allowed the identification of 2 BAGs independent of the 
BGG (X094 and X1026) which carried the same chromosomal 
35 fragment 3 594 to 3 74 9 kb. Although the terminal 
sequence data suggested that these clones had to have a 
size of about 155 kb, the size estimated by Hindlll or 
Dral digestions followed by PFGE separation were only 
aboui; 100 kb. This difference indicated that the 
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inserts of clones X094 and X1026 probably extended from 
the repeated Hindlll sites at 3 594 kb to the authentic 
Hindlll site at position 3 749 kb, and that an internal 
deletion had taken place inside the duplicated unit. 

5 

This was confirmed by hybridization experiments under 
stringent conditions previously described on the 
genomic DNA, digested with Hindlll, of M. tuberculosis 
H37RV, M. jbovis and BCG- Pasteur using the DNA of the 

10 radiolabeled X495 clone. The size of one of the bands 
which hybridized with this DNA in the HindTTX profiles 
of M. tuberculosis H3 7Rv and M, bovis were about 22 kb, 
whereas the corresponding band in BCG was 24 kb 
exactly, which was observed with the BAG clones. 

15 Furthermore, the hybridization results showed that a 
band of 34 kb in the Hindlll profile of the X094 clone 
also hybridized with the genomic DNA of the X4 95 clone, 
which confirmed that the X094 and X1026 clones 
contained the duplicated DNA of the genomic region 

20 covered by X495. PGR reactions and the sequence of the 
DNA of the X094 BAG clone allowed the identification of 
a second joining point JDU2B at an equivalent position 
at 3 608 471/3 671 535 in M. tuberculosis H37Rv. This 
confirmed that DU2 resulted from a direct duplication 

25 of a region of 99 225 bp corresponding to the sequences 
between positions 3 590 90 0 and 3 69 0 124 in the 
M. tujberculosis H3 7Rv genome , and an internal deletion 
of 63 064 bp then took place. The residual DU2 unit is 
thus 3 6 162 bp long, which is equivalent with the 

3 0 mapping data, and BGG-Pasteur is diploid for the 
Rv3213c-Rv'323 0c and Rv3290c-Rv33 02c genes. 

Finally, experiments involving PGR, PFGE mapping and 
sequencing of the terminal sequences with BAG X094 
3 5 suggested that BGG-Pasteur contained additional DNA in 
the chromosomal region of the 3 691 to 3 74 9 kb HirzdIII 
site. Direct comparison with the M. tuberculosis 
Rv403 BAG clone allowed the detection of two additional 
Hindlll sites in this region since the Hindlll 



- 40 - 

fragments of 4 8 kb present in Rv4 03 (corresponding to 
fragment 3 691 to 3 74 9) were represented by two bands 
of 22 to 3 6 kb in BCG. This region of the 
M, tuberculosis H3 7Rv chromosome contains a copy of 
5 136110 which is not flanked by the characteristic 
direct repeat units of 3 bp. It is now clear that there 
were initially two copies of 1S6110 which served as 
substrate for a recombination event. This gave rise to 
the deletion of a segment of 4 kb of the genome of 

10 M. tuberculosis H3 7Rv (RvD5) , which is always present 
in BCG, as well as in M. bovis and the clinical 
isolates of M. tuberculosis , Analysis of the sequence 
of this region indicated that this 4 kb fragment 
contains two Hindlll sites and that there is absent 

15 therefrom the IS6110 sequence which is present at this 
site in M. tuberculosis H3 7Rv, Using internal primers 
for RvD5 (table 4) , the inventors obtained amplicons 
with the genomic DNA of all the M. bovis BCG strains 
tested, and the M. bovis strain, as well as with the 

20 DNA of clones X094 and X1026, but not with the 
M. tuberculosis H37Rv and H37Ra strains. 

Experiments with multiple sets of primers (3689. 500F 
(SEQ ID No. 22) or 3689. 900F (SEQ ID No, 24) (sense) 

25 3591. GOOR (SEQ ID No. 23), 3591, 500R (SEQ ID No. 25) or 
3592. DOOR (reverse)) to amplify the joining region at 
the level of the base 3690124/3590900 (described above) 
in various M, bovis BCG strains revealed that amplicons 
could only be obtained from M. jbovis BCG- Pasteur and 

3 0 from two other BCG substrates, whereas the other BCG 
substrates gave no amplicon. Confirmation of the 
results may be obtained on Hindlll spots hybridized 
with labeled DNA derived f rom ' the 3689500F-3690 . OOOR 
region which ought to give rise to bands with 

3 5 rearranged BCG strains, one of them has a size of about 
24 kb, about 2 kb more than the corresponding band in 
the genomic digestions of M. bovis and M. tuberculosis . 
The second band of about 3 5 kb ought to be present only 
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in the rearranged strains and not in M. tuberculosis 
H3 7RV or the M. bovis type strain (figure 6) . 

The screening of clones of 2 00 0 X and XE (Gordon et 
5 al., 1999) for BACs containing both JDU2A and JDU2B 
junctions, that is to say which cover the complete 
rearranged region allowed the identification of three 
BACs (X1070, XE377 and XE256) which produced amplicons 
with the two sets of primers. The inserts were 

10 estimated by PFGE to have a size of 95, 86 and 97 kb 
respectively. On the basis of these PGR results, data 
corresponding to the terminal sequences and the 
presence of three chromosomal Hindi II fragments of 37, 
3 6 and 24 kb, the inventors concluded that the XI 070 

15 clone overlaps the X495 clone. However, it contained a 
chromosomal Hindlll fragment of 36 kb which was neither 
present in the X4 95 clone nor in the X094 clone and, 
with the terminal sequence data, this would suggest the 
presence of a third copy of the Hindi 1 1 site at 

2 0 3 594 kb in the rearranged region. New proof of this 

was obtained when the XE2 56 and XE3 77 clones obtained 
from an £?coRI library in pBACe3.6 were analyzed. 
Depending on the terminal sequence data, XE2 56 extends 
from the EcoRI site at 3 597 kb to the EcoRl site at 
25 3 713 kb, and XE377 from the EcoRI site at 3 679 kb to 
the £;coRI site at 3 715 kb. The fact that these clones 
repeatedly gave amplicons for the two cited joining 
regions JDU2A and JDU2B was not in agreement with their 
size and their terminal sequences. However, these data 

3 0 were coherent with the fact that the region of 3 6 

162 bp of DU2 was present not only as one but rather as 
two tandem copies. Hybridization (according to the 
method of Philipp et al . , 1996) of the fragments of 
Hindi I I digested DNA of the XE2 56, X1070 and XE3 77 
35 clones with a 0 . 5 kb probe of the 3 675 kb genomic 
region confirmed the PGR results. A 24 kb fragment of 
the X1070 clone hybridized, equivalent to that of the 
X4 95 clone, and a single 3 6 kb fragment which 
corresponds to an additional copy of DU2 was also 
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present. Two fragments of 33 and 34 kb of the XE256 
clone hybridized with the probe. The 33 kb fragment 
corresponds to a region which extends from the Hindi I I 
site present in the vector adjacent to the EcoRl 
5 cloning site to the nearest Hindlll site in the 
mycobacterial insert , whereas the 34 kb fragment is 
identical to that which is also present in the X094 
clone. The 33 kb fragment partially overlapped the 
X1070 clone whereas the 34 kb Hindlll fragment was 
10 identical to that present in the X094 and XE377 clones. 

These data indicate that two tandem copies of DU2 exist 
in the BCG-Pasteur genome. This was confirmed by the 
hybridizations of the products of digestion with 
Hindi II of the genomic DNA of BCG- Pasteur , 
M, tuberculosis H37Rv and M. Jbovis since all hybridized 
with the 3 675 probe. As expected, only one band of 
22 kb was observed with M. tuberculosis and M, bovis 
whereas three bands of 24, 34 and 36 kb were detected, 
by hybridization, in the BCG-Pasteur genome. However, 
the hybridization signal for the 3 6 kb fragment was 
very weak. The fact that the 24 and 36 kb bands present 
in the BAG X1070 clone hybridized with the 3 675 probe 
with the same intensity, whereas those in the genomic 
DNA of BCG-Pasteur do not, suggests that only a 
subpopulation of the BCG-Pasteur culture contains the 
second copy of DU2 . Thus, the difference observed in 
the intensity of hybridization may reflect that the 
second copy of DU2 was only recently acquired and 
indicates variants which contain one or two copy or 
copies of DU2 probably exist in the same M. bovis BCG- 
Pasteur culture. 

Similar results were obtained with the genomic DNA 
3 5 fragments digested with Xbal from M, tuberculosis , 
M. bovis and BCG-Pasteur which hybridize with the 3 675 
probe. In the M. tuberculosis H37Rv digestion, the 
3 675 probe hybridized with a 183 kb fragment (genomic 
position 3 646 kb to 3 829 kb) . The corresponding 



M. hov-is fragment was approximately 178 kb, this 
difference in size being due to the absence of several 
insertion elements which are present only in the 183 kb 
M. tuberculosis H37Rv genomic fragment. The product of 
digestion with BCG-Pasteur Xbal contained two fragments 
of 215 and 250 kb which hybridized with the 3 675 
probe. These two fragments corresponded to the 178 kb 
fragment observed in the M. bovis genome increased by 
3 6 or 72 kb because of the presence of one or two 
copies of DU2 . It is of interest to note that the 
hybridization signal for the 2 50 kb fragment was less 
intense than the signal obtained for the 215 kb 
fragment, which confirms the previous observations with 
the products of digestion with Hindlll. 

These observations indicate that this region of the BCG 
genome is still dynamic and that a subpopulation of 
cells is triploid for the Rv3213c-Rv3230c and Rv3290c- 
Rv3302c genes. These comparative data between the 
sequence of the genome of M, tuberculosis H37Rv and of 
BCG-Pasteur indicate that BCG-Pasteur ought to be 
triploid for at least 58 genes, and that at one point 
of their evolution, their common ancestor contained 
duplicated copies of 60 additional genes which were 
lost when the deletion internal to DU2 occurred. 
Furthermore, the presence of DUl and of DU2 , and in 
particular the demonstration of the fact that DU2 is 
present in the form of two copies in a subpopulation of 
BCG-Pasteur, suggests that the tandem duplication 
process in BCG is still dynamic. 

The invention therefore provides data which may make it 
possible to compare the various BCG strains with each 
other. Moreover, the invention shows the benefit of 
using mapping strategies with BACs as complement for 
sequencing the genome and allows the identification of 
possible drawbacks of projects which are based solely 
on the sequencing of clones by the ^^shot gun 
technique''. Thus,^ without this BAG library, it is 
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highly probable that these complex genomic 
rearrangements in the M, bovis BCG strains would not 
have been detected. It is therefore an advantage of the 
present invention to provide data which allow the 
5 characterization and possibly the immunogenic and 
protective classifications of the various BCG strains 
which are currently used clinically and for vaccine 
applications, and to provide information which allow 
the specific identification of M, tuberculosis in 

10 relation to bovis and M. bovis BCG, or information 
which allow the specific identification of M, bovis BCG 
in relation to M. bovis. The present invention thus 
provides important information for the study and the 
epidemiology of tuberculosis, and for the subsequent 

15 studies of genomic rearrangements in the different 
bacteria . The technique developed in the present 
invention is exemplified by the results of the present 
invention and may be applied to other bacterial and/or 
parasite genomes . 

20 

Thus, the fact that M. bovis BCG-Pasteur and two other 
substrains of M. bovis BCG have a duplicated complement 
set of genes responsible for major processes such as, 
inter alia, cell division and signal translation, 

2 5 comprising two replication origins, is one of the 

surprising aspects revealed to the inventors by this 
approach to genetic comparisons. 

Since the biological material is subject to changes, 

3 0 and given that BCG vaccination trials highly varied 

protection results (0-80%) , it could be important to 
evaluate if this variation in the efficacy of 
protection may be partly attributed to the choice of 
the BCG substrain used. 

35 

It is therefore advisable to carry out additional 
investigations in order to determine if a correlation 
exists between genomic features and phenotypic 
variations among the various BCG substrains. 
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The BAC libraries have been deposited at the Collection 
Nationale de Culture de Microorganismes (CNCM) , 2 5 rue 
du Dr Roux, 75724 PARIS CEDEX 15, France according to 
5 the provisions of the Budapest treaty. 

BAC of M. tuberculosis H37Rv Serial Number 11945 

BAC of M. bovis BCG Serial Number 12 049 



10 
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TABLE 1: DESCRIPTION OF THE DELETIONS 



Deletions 


ORF/ 


POSITION® ON THE 


Size of 


Putative function or family 




Gene 


GENOME OF M, 
TUBERCULOSIS H37RV 


THE 
PRODUCT 






Rv2346c 


2625889-2626170 


94 aa 


ESAT-6 family 




Rv2347c 


2626224-2626517 


98 aa 


QLISS family 




Rv2348c 


2626655-2626978 


108 aa 


Unknovm 


RD5 


plcC 


2627173-2628696 


508 aa 


Phospholipase 




plcB 


2628782-2630317 


512 aa 


Phosphol ipase 




pic A 


2630538-2632073 


512 aa 


Phosphol ipase 




Rv2352c 


2632924-2634096 


3 91 aa 


PPE protein 




Rv2353c 


2634529-2635590 


354 aa 


PPE protein 




RV3425 


3842235-3842762 


176 aa 


PPE protein 


RD6 


RV3426 


3843032-3843727 


232 aa 


PPE protein 




Rv3427c 


3843884-3844636 


251 aa 


Transposase IS1532 




Rv342 8c 


3844737-3845966 


410 aa 


Transposase ISI532 




RV1964 


2207698-2208492 


265 aa 


Integral membrane 




RV1965 


2208505-2209317 


271 aa 


Integral membrane 




Mce3 


2209325-2210599 


425 aa 


Invasin-type protein, 
RGD motif 




RV1967 


2210599-2211624 


342 aa 


Exported protein 




RV1968 


2211624-2212853 


410 aa 


Exported protein, RGD 
motif 




Rvl969 


2212853-2214122 


423 aa 


Exported protein 




IprM 


2212853-2214122 


377 aa 


Lipoprotein 




RV1971 


2215255-2216565 


437 aa 


Exported protein 


RD7 


RV1972 


2216590-2217162 


191 aa 


Membrane protein 




Rvl973 


2217162-2217641 


160 aa 


Exported protein 




RV1974 


2217657-2218031 


125 aa 


Unknown 




RV1975 


2218050-2218712 


221 aa 


Exported protein 




Rvl976c 


2218845-2219249 


135 aa 


Unimown 




RV1977 


2219752-2220795 


348 aa 


Unknown, Zn binding 

signature | 
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TABLE 1 (CONTINUED) 





ephA 


4057730-4058695 


322 aa 


Epoxide hydrolase 




RV3618 


4058695-4059879 


395 aa 


Monooxygena s e 




RV3619C 


4059984-4060265 


94 aa 


ESAT-6 family 


RD8 


Rv362 0c 


4060295-4060588 


98 aa 


QLISS family 




RV3621C 


4060648-4061886 


413 aa 


PPE protein 




Rv3622c 


4061899-4062195 


99 aa 


PE protein 


IpqG 


4062524-4063243 


240 aa 


Lipoprotein 


RD9 


cobL 


2328975-2330144 


390 aa 


Precorrin methylase 


Rv2073c 


2330215-2330961 


249 aa 


Oxi dor educ t as e 


Rv2074 


2330991-2331401 


137 aa 


Unknown 


RV2075 


2331417-2332877 


487 aa 


Exported protein or 
membrane 


RDIO 


echAI 


265505-266290 


262 aa 


Enoyl-CoA hydratase 


Rv0223c 


266302-267762 


487 aa 


Aldehyde 
dehydrogenase 


RvDl 


RvDl- 
ORFl 




675 aa 


Unknown 


RvDl- 
ORF2 


_ 


318 aa 


Unknown 


Rv2024c 


_ 


1606 aa 


Unknown 


RvD2 


pi CD 


- 


514 aa 


Phospholipase 


RvD2- 
ORFl 




394 aa 


Sugar transferase 


RvD2- 
ORF2 




367 aa 


Oxidoreductase 


RvD2- 
ORF3 




945 aa 


Membrane protein 


RV1758 




143 aa 


Cutinase 



5 * As defined by Cole et al . , Nature, 1998, 393, pages 
537-544 
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TABLE 3: PGR PRIMERS 



Deletion 


Name of the 

PRIMER 


Sequence 


Expected product 

SIZE 


RD4* 


277-32F 
277-32R 


ACATGTACGAGAGACGGCATGAG 
ATCCAACACGCAGCAACCAG 


H37RV: 1031 bp 
BCG: No product 


RD5* 


1CC-B.5P 
1CC-B.3P 


GATTCCTGGACTGGCGTTG 
CCACCCAAGAAACCGCAC 


H37RV: 1623 bp 
BCG: No product 


RD6 


78-delI 
78-del2 


ACAAAATCGCCTCGTCGCC 
ACCTGTATTCGTCGTTGCTGACC 


H37RV: 8729 bp 
BCG: 3801 bp 


RD7 


v420-flankl.F 
v420-flaiik2 .R 


GGTAATCGTGGCCGACAAG 
CTTGCGGCCCAATGAATC 


H37Rv: 13068 bp 
BCG; 350 bp 


RD8* 


D8-ephA.F 
D8-ephA.R 


GTGTGATTTGGTGAGACGATG 
GTTCCTCCTGACTAATCCAGGC 


H37RV: 678 bp 
BCG: No product 




B2329.5F 
B2332.5R 


CTGCCCGTCGTGCGCGAA 
AGTGGCTCGGCACGCACA 


H37Rv: 3048 bp 
BCG: 1018 bp 


RDIO 


D10-264F 
D10-267R 


CGCGAAAGAGGTCATCTAAAC 
GATGCTCAAGCCGTGCACC 


H37Rv: 3024 bp 
BCG: 1121 bp 


RvDl 


Boli2268469,F 
Boli2269064.R 


GCGCCACAAACGTACTATCTC 
GTTTCACCGGCTGTCGTTC 


H37Rv: 595 pb 
BCG: 5595 bp 


RvD2 


28-IS6110B.5^ 
28-RHS.2 


CCACACCGCAGGATTGGCAAG 
TCGAGTGCATGAACGCAACCGAG 


H37Rv: 2007 bpt 
BCG: 7456 bp 



* = Primers internal to the deletion 

t = Size including a copy of 1S6110 not present in BCG 
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TABLE 4: PRIMERS FOR THE IDENTIFICATION OF THE 
DEPLETED SAID REGIONS 



REGION 


Name of the 

PRIMER 


SEQUENCE 


DUl JUNCTION 


TB16.0F 
TB16,5F 
TB4398 . 7R 


GAG CCA ACG ATG ATG ATG ACC 
GGT CAC GGT CGG TGT CGT C 
CAG AAC TGC AGG GGT GGT AC 


DU2A JUNCTION 


TB3689,5F 
TB3591 . OR 

TB3591.5R 


CTA GTT GTT CAG CCG CGT CTT 
ACC GGG GTG TCG GCC AGT T 
TCG CGG CCA CCG TGC GTA A 
GGC GCC TAT GAC TGA TAC CC 


DU2B JUNCTION 


TB3672 . OR 
TB3671.7R 


riAA CAH GGT CGC GGA GTC T 
TCG AGG AGG TCG AGT CCT GT 
GGG TTC ATG AGG TGC TAG GG 


DETECTION PRIMERS 
RvD5 


RvD5-intF 
RvD5-intR 


GGG TTC ACG TTC ATT ACT GTT C 
CCT GCG CTT ATC TCT AGC GG 


HYBRIDIZATION 
PROBE 
DUl 


TB4411 . OF 
TBO . 3R 


CCG GCC ACT CAC TGC CTT C 
ACG GTA GTG TCG TCG GCT TC 


HYBRIDIZATION 
PROBE 
DU2 (probe 3 675) 


TB 3 675 OF 
TB3675.5R 


CCA ACA CCG TCA ACT ACT CGA 
ATC GCA GAA CTC CGG CGA CA 


SEQUENCING OF THE 
REGION 
dnaA-dnaN 


TB1.2F 
TB1.5F 
TBI . 8F 
TB2 . 2R 


CGA TCT GAT CGC CGA CGC C 
TCC GTC AGC GCT CCA AGC G 
GTC CCC AAA CTG CAC ACC CT 
AAT CCG GAA ATC GTC AGA CCG 
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PCT/FROO/00637 



CLAIMS 

1. Nucleotide or polynucleotide sequences deleted 
from the genome of M, hovis BCG/M. Jbovis and 
5 present in the genome of M. tuberculosis or 

conversely chosen from the following ORFs and 
genes: Rv2346c, Rv2347c, Rv2348c, plcC, plcB, 
plcA, Rv2352c, Rv2353c, Rv3425, Rv3426, Rv3427c, 
Rv3428c, Rvl964, Rvl965, jnce3 , Rvl967, Rvl968, 

10 Rvl969, IprM, Rvl971, Rvl972, Rvl973, Rvl974, 

Rvl975, Rvl976c, Rvl977, ephA, Rv3618, Rv3619c, 
Rv3620c, Rv3621c, Rv3622c, iPqG, cobL, Rv2073c, 
Rv2 0 74, Rv2 0 75, echAl , Rv0223c, RvDl-ORFl, RvDl- 
0RF2, Rv2 024c, plcD, RvD2-0RFl, RvD2-ORF2, RvD2 - 

15 0RF3, Rvl758. 

2- The nucleotide or polynucleotide sequences as 
claimed in claim 1 grouped together in nucleotide 
regions RD5 to RDIO and RvDl and RvD2 according to 
2 0 the following distribution: 

- RD5: Rv2346c, Rv2347c, Rv2348c, plcC, plcB, 
plcA, Rv2352c, Rv2353c, 

- RD6: Rv3425, Rv3426, Rv3427c, Rv3428c, 

- RD7: RV1964, Rvl965, 2nce3 , Rvl967, Rvl968, 
25 Rvl969, IprM, Rvl971, Rvl972, Rvl973, Rvl974, 

Rvl975, Rvl976c, Rvl977, 

RD8: ephA, Rv3618, Rv3619c, Rv3620c, Rv3621c, 
Rv3 622c, IpqG, 

- RD9: coJbL, Rv2073c, Rv2074, Rv2075c, 
30 - RDIO: echAl, Rv0223c, 

RvDl: RvDl-ORFl, RvDl-ORF2, Rv2024c 

- RvD2 : plcD, RvD2 -ORFl , RvD2 -0RF2 , RvD2 -0RF3 , 
Rvl758 . 

35 3. A method for the discriminatory detection and 
identification of M. hovis BCG/M. hovis or 
M, tuberculosis in a biological sample, comprising 
the following steps: 
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15 



20 
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a) isolation of the DNA from the biological 
sample to be analyzed or production of a 
cDNA from the RJSFA of the biological sample, 

b) detection of the DNA sequences of the 
mycobacterium present in said biological 
sample, 

c) analysis of said sequences with the 
nucleotide and polynucleotide sequences as 
claimed in claim 1 or 2 . 

The method as claimed in claim 3, in which the 
detection of the mycobacterial DNA sequences is 
carried out using nucleotide sequences 
complementary to said DNA sequences. 

The method as claimed in claim 3 or 4, in which 
the detection of the mycobacterial DNA sequences 
is carried out by amplification of these sequences 
using primers. 



6. The method as claimed in claim 5, in which the 
primers have a nucleotide sequence chosen from the 
group comprising SEQ ID No. 3, SEQ ID No. 4, 
SEQ ID No. 5, SEQ ID No. 6, SEQ ID No. 7, 

25 SEQ ID No. 8, SEQ ID No . 9, SEQ ID No. 10, 

SEQ ID No. 11, SEQ ID No. 12, SEQ ID No. 13, 

SEQ ID No. 14, SEQ ID No. 15, SEQ ID No. 16, 

SEQ ID No. 17, and SEQ ID No. 18 with: 

the pair SEQ ID No. 3 /SEQ ID No. 4 specific for 
30 RD5, 

the pair SEQ ID No. 5/SEQ ID No. 6 specific for 
RD6, 

the pair SEQ ID No. 7/SEQ ID No. 8 specific for 
RD7, 

3 5 - the pair SEQ ID No. 9/SEQ ID No. 10 specific for 

RD8, 

the pair SEQ ID No. ll/SEQ ID No. 12 specific for 
RD9, 



the pair SEQ ID No. 13/SEQ ID No. 14 specific for 
RDIO, 

the pair SEQ ID No. 15/SEQ ID No. 16 specific for 
RvDl , and 

the pair SEQ ID No. 17/SEQ ID No. 18 specific for 
RvD2, 

The method as claimed in claim 6, in which the 
group from which the primers are chosen comprises, 
in addition, the nucleotide sequences SEQ No, 1 
and SEQ No. 2, the pair SEQ ID No. 1/SEQ ID No. 2 
being specific to RD4 . 

A method for the discriminatory detection and 
identification of M. bovis BCG/M. bovis or M. 
tuberculosis in a biological sample, comprising 
the following steps: 

a) bringing the biological sample to be analyzed 
into contact with at least one pair of primers 
as defined in claim 6 or 7, the DNA contained 
in the sample having been, where appropriate, 
made accessible to the hybridization 
beforehand, 

b) amplification of the DNA of the mycobacterium, 

c) visualization of the amplification of the DNA 
fragments . 

^ A kit for the discriminatory detection and 
identification of M. bovis BCG/M. bovis or 
M. tuberculosis in a biological sample comprising 
the following elements: 

a) at least one pair of primers as defined in 
claim 6 or 7, 

b) the reagents necessary to carry out a DNA 
amplification reaction, 

c) optionally, the necessary components which make 
it possible to vex'ify or compare the sequence 
and/or the size of the amplified fragment. 
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The use of at least one pair of primers as defined 
in claim 6 or 7 for the amplification of a DNA 
sequence from M. hovis BCG/M. bovis or M. 
tuberculosis , 

A product of expression of all or part of the 
nucleotide or polynucleotide sequences deleted 
from the genome of M. bovis BCG/M, bovis and 
present in M. tuberculosis or conversely as 
defined in claim 1. 

A method for the discriminatory detection in vitro 
of antibodies directed against M, bovis BCG/ 
M. bovis or M, tuberculosis in a biological 
sample, comprising the following steps: 

a) bringing the biological sample into contact 
with at least one product as defined in claim 
11, 

b) detecting the antigen -antibody complex formed. 

A method for the discriminatory detection of a 
vaccination with M. bovis BCG or an infection by 
M. tuJberculosis in a mammal, comprising the 
fol lowing steps : 

a) preparation of a biological sample containing 
cells, more particularly cells of the immune 
system of said mammal and more particularly 
still T cells, 

b) incubat'ion of the biological sample of step a) 
with at least one product as defined in claim 
11, 

c) detection of a cellular reaction indicating 
prior sensitization of the mammal to said 
product, in particular cell proliferation 
and/or synthesis of proteins such as gamma- 
interf eron. 

A kit for the in vitro diagnosis of an 
M. tuberculosis infection in a mammal optionally 



vaccinated beforehand with M, bovis BCG 
comprising : 

a) a product as defined in claim 11, 

b) where appropriate, the reagents for the 
constitution of the medium suitable for the 
immunological reaction, 

c) the reagents allowing the detection of the 
antigen- antibody complexes produced by the 
immunological reaction, 

d) where appropriate, a reference biological 
sample (negative control) free of antibodies 
recognized by said product, 

e) where appropriate , a reference biological 
sample (positive control) containing a 
predetermined quantity of antibodies recognized 
by said product . 

A mono- or polyclonal antibody, its chimeric 
fragments or antibodies, characterized in that 
they are capable of specifically recognizing a 
product as defined in claim 11. 

A method for the discriminatory detection of the 
presence of an antigen of M. bovis BCG/ M. bovis 
or M, tuberculosis in a biological sample 
comprising the following steps: 

a) bringing the biological sample into contact 
with an antibody as claimed in claim 15, 

b) detecting the antigen- antibody complex formed. 

A kit for the discriminatory detection of the 
presence of an antigen of M. bovis BCG/M. bovis or 
M. tuberculosis in a biological sample comprising 
the following steps: 

a) an antibody as claimed in claim 15, 

b) the reagents for constituting the medium 
suitable for the immunological reaction. 
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c) the reagents allowing the detection of the 

antigen- antibody complexes produced by the 
immunological reaction . 

5 18, An immunological composition, characterized in 

that it comprises at least one product as defined 
in claim 11. 



19. A vaccine, characterized in that it comprises at 
10 least one product as defined in claim 11 in 

combination with a pharmaceutically compatible 
vehicle and, where appropriate, one or more 
appropriate immunity adjuvants. 



15 20. A method for the discriminatory detection and 
identification of M. bovis BCG or M, tuberculosis 
in a biological sample comprising the following 
steps : 

digestion with Hindlll , of at least part of the 
2 0 genome of the mycobacterium present in a 

biological sample to be analyzed, and 
analysis of the restriction fragments thus 
obtained. 



25 21. The method as claimed in claim 20, in which the 
analysis of the restriction fragments consists in 
counting said fragments and/or in determining 
their length. 



30 22. Method of detection as claimed in either of claims 
20 and 21, in which the analysis of the 
restriction fragments consists in bringing them 
into contact with at least one probe under 
stringent hybridization conditions and in 

3 5 identifying the fragment parts or fragment 

hybridized. 



A method as claimed in claim 22, characterized in 
that the probe is obtained by amplification of the 



genomic DNA with primers chosen from the group 
SEQ ID No. 31, SEQ ID No. 32, SEQ ID No. 33 or 
SEQ ID No. 34 with the pair: 

- SEQ ID No. 31/SEQ ID No. 32 specific for DUl 

- SEQ ID No, 33/SEQ ID No, 34 specific for DU2 

Method according to claim 20, characterized in 
that amplification is carried out of the fragments 
obtained with primers chosen from the group 
SEQ ID No, 19, SEQ ID No. 20, SEQ ID No. 21, 

SEQ ID No. 22, SEQ ID No. 23, SEQ ID No. 24, 

SEQ ID No. 25, SEQ ID No. 26, SEQ ID No. 27 and 
SEQ ID No. 2 8 with: 

- SEQ ID No. 19, SEQ ID No. 2 0/SEQ ID No , 21 
specific for JDUl 

- SEQ ID No. 22, SEQ ID No. 24/SEQ ID No. 23, 
SEQ ID No. 2 5 specific for JDU2A 

- SEQ ID No. 26/SEQ ID No. 27, SEQ ID No. 28 
specific for JDU2B 

The method as claimed in claim 20, characterized 
in that the fragments obtained are amplified with 
primers chosen from the group SEQ ID No. 35, 
SEQ ID No, 36, SEQ ID No. 3 7 and SEQ ID No. 3 8 
specific for DUl and then to analyze them by 
sequencing . 
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<120> Deleted sequences in M. bovis BCG/M. bovis or M. 
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sequences and vaccines . 

<130> D18014 

<150> FR 99 03 250 
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<150> PCT/FROO/00637 
<151> 2000-03-16 

<160> 38 

<170> Patentin Vers. 2.0 

<210> 1 
<211> 24 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y277-32F 
<400> 1 

gacatgtacg agagacggca tgag 24 

<210> 2 
<211> 21 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y277-32R 
<400> 2 

aatccaacac gcagcaacca g 21 

<210> 3 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> plcC-B.5P 
<400> 3 

ggattcctgg actggcgttg 20 



<210> 4 



2 



<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> plcC-B.3P 
<400> 4 

cccacccaag aaaccgcac 19 

<210> 5 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y78-dell 



<210> 6 
<211> 24 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y78-del2 
<400> 6 

aacctgtatt cgtcgttgct gacc 24 

<210> 7 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv420-flankl,F 
<400> 7 

tggtaatcgt ggccgacaag 20 

<210> 8 
<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RV420-flank2.R 



<400> 5 

acaaaaatcg cctcgtcgcc 



20 



<400> 8 

tcttgcggcc caatgaatc 



19 



<210> 
<211> 
<212> 
<213> 



Mycobacterium tuberculosis 



9 

22 
DNA 



<220> 

<223> RD8-ephA.F 



3 



<400> 9 

ggtgtgattt ggtgagacga tg 22 

<210> 10 
<211> 23 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RD8-ephA.R 
<400> 10 

agttcctcct gactaatcca ggc 23 

<210> 11 
<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> TB2329.5F 
<400> 11 

tctgcccgtc gtgcgcgaa 19 

<210> 12 
<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> TB2332.5R 
<400> 12 

cagtggctcg gcacgcaca 19 

<210> 13 
<211> 22 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RD10-264F 
<400> 13 

tcgcgaaaga ggtcatctaa ac 22 

<210> 14 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DR10-267R 



<400> 14 



agatgctcaa gccgtgcacc 



<210> 15 
<211> 22 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> TBoli2268469.F 
<400> 15 

cgcgccacaa acgtactatc tc 

<210> 16 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> TBoli2269064.R 
<400> 16 

agtttcaccg gctgtcgttc 

<210> 17 
<211> 22 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> Y28-IS6110B.5' 
<400> 17 

cccacaccgc aggattggca ag 

<210> 18 
<211> 24 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> Y28-RHS.2 
<400> 18 

atcgagtgca tgaacgcaac cgag 

<210> 19 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> TB16.0F 
<400> 19 

gagccaacga tgatgatgac c 

<210> 20 
<211> 19 



<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<223> TB16.5F 
<400> 20 

ggtcacggtc ggtgtcgtc 

<210> 21 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB4398.7R 
<400> 21 

cagaactgca ggggtggtac 

<210> 22 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3689.5 
<400> 22 

ctagttgttc agccgcgtct t 

<2ia> 23 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3591.0R 
<400> 23 

accggggtgt cggccagtt 

<210> 24 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3689.9F 
<400> 24 

tcgcggccac cgtgcgtaa 

<210> 25 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 



<223> TB3591.5R 



<400> 25 

ggcgcctatg actgataccc 

<210> 26 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3608 . OF 
<400> 26 

gaacagggtc gcggagtct 

<210> 27 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3672.0R 
<400> 27 

tcgaggaggt cgagtcctgt 
<210> 28 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3671.7R 
<400> 28 

gggttcatga ggtgctaggg 

<210> 29 
<211> 22 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> RvD5~intF 
<400> 29 

gggttcacgt tcattactgt tc 

<210> 30 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> RvD5-intR 
<400> 30 

cctgcgctta tctctagcgg 



<210> 31 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB4411.0F 
<400> 31 

ccggccactc actgccttc 

<210> 32 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB0.3R 
<400> 32 

acggtagtgt cgtcggcttc 

<210> 33 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3675. OF 
<400> 33 

ccaacaccgt caactactcg a 

<210> 34 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3675.5R 
<400> 34 

atcgcagaac tccggcgaca 

<210> 35 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB1.2F 
<400> 35 

cgatctgatc gccgacgcc 

<210> 36 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<223> TB1.5F 
<400> 36 

tccgtcagcg ctccaagcg 

<210> 37 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB1.8F 
<400> 37 

gtccccaaac tgcacaccct 

<210> 38 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB2.2R 
<400> 38 

aatccggaaa tcgtcagacc g 
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SEQUENCE LISTING 
<110> INSTITUT PASTEUR 

<120> Deleted sequences in M. bovis BCG/M. bovis or M. 
tuberculosis, method for detecting mycobacteria using 
said sequences and vaccines. 

<130> D18014 

<160> 38 

<170> Patentin Vers. 2.0 

<210> 1 
<211> 24 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y277-32F 
<400> 1 

gacatgtacg agagacggca tgag 24 

<210> 2 
<211> 21 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y277-32R 
<400> 2 

aatccaacac gcagcaacca g 21 

<210> 3 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> plcC-B.5P 
<400> 3 

ggattcctgg actggcgttg 2 0 



<210> 4 
<211> 39 
<212> DNA 



<213> Mycobacterium tuberculosis 



<220> 

<223> plcC-B.3P 
<400> 4 

cccacccaag aaaccgcac 

<210> 5 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y78-dell 
<400> 5 

acaaaaatcg cctcgtcgcc 

<210> 6 
<211> 24 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Y78-del2 
<400> 6 

aacctgtatt cgtcgttgct gacc 

<210> 7 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv420-f lankl .F 
<400> 7 

tggtaatcgt ggccgacaag 

<210> 8 
<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RV420-f lank2 .R 
<400> 8 

tcttgcggcc caatgaatc 



- 3 - 



<210> 9 
<211> 22 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RD8-ephA.F 
<400> 9 

ggtgtgattt ggtgagacga tg 

<210> 10 
<211> 23 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RD8-ephA.R 
<400> 10 

agttcctcct gactaatcca ggc 

<210> 11 
<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> TB2329.5F 
<400> 11 

tctgcccgtc gtgcgcgaa 

<210> 12 
<211> 19 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> TB2332.5R 
<400> 12 

cagtggctcg gcacgcaca 

<210> 13 
<211> 22 
<212> DNA 

<213> Mycobacterium tuberculosis 



- 4 - 



<400> 13 

tcgcgaaaga ggtcatctaa ac 

<210> 14 
<211> 20 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DR10-267R 
<400> 14 

agatgctcaa gccgtgcacc 

<210> 15 
<211> 22 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> TBoli2268469.F 
<400> 15 

cgcgccacaa acgtactatc tc 

<210> 16 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> TBoli2269064 .R 
<400> 16 

agtttcaccg gctgtcgttc 

<210> 17 
<211> 22 
<212> DNA 

<213> Mycobacterium bovis 
<220> 

<223> Y28-IS6110B, 5' 
<400> 17 

cccacaccgc aggattggca ag 

<210> 18 
<211> 24 
<2:i2> DNA 



- 5 - 



<220> 

<223> Y28-RHS.2 
<400> 18 

atcgagtgca tgaacgcaac cgag 

<210> 19 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB16.0F 
<400> 19 

gagccaacga tgatgatgac c 

<210> 20 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB16.5F 
<400> 20 

ggtcacggtc ggtgtcgtc 

<210> 21 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB4398.7R 
<400> 21 

cagaactgca ggggtggtac 

<210> 22 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3689.5 
<400> 22 

ctagttgttc agccgcgtct t 

<210> 23 
<211> 19 



- 6 - 

<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3591.0R 
<400> 23 

accggggtgt cggccagtt 19 

<210> 24 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3689.9F 
<400> 24 

tcgcggccac cgtgcgtaa 19 

<210> 25 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3591.5R 
<400> 25 

ggcgcctatg actgataccc 2 0 

<210> 26 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3608.0F 
<400> 26 

gaacagggtc gcggagtct 19 

<210> 27 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3672 . OR 



<400> 27 

t c g agg agg t c g ag t c c t g t 



20 



- 7 - 



<210> 28 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB3671.7R 
<400> 28 

gggttcatga ggtgctaggg 2 0 

<210> 29 
<211> 22 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223>, RvD5-intF 
<400> 29 

gggttcacgt tcattactgt tc 22 

<210> 30 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> RvD5-intR 
<400> 30 

cctgcgctta tctctagcgg 2 0 

<210> 31 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB4411.0F 
<400> 31 

ccggccactc actgccttc , 19 

<210> 32 

<211> 20 

<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<?23> TB0,3R 



- 8 - 

<400> 32 

acggtagtgt cgtcggcttc 2 0 

<210> 33 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<223> TB3675.0F 



<400> 33 

ccaacaccgt caactactcg a 21 

<210> 34 
<211> 20 
<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<223> TB3675.5R 



<400> 34 

atcgcagaac tccggcgaca 20 

210> 35 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<223> TB1.2F 



<400> 35 

cgatctgatc gccgacgcc 19 

210> 36 
<211> 19 
<212> DNA 

<213> Mycobacterium bovis BCG 



<220> 

<223> TB1.5F 



<400> 36 

tccgtcagcg ctccaagcg 19 

210> 37 
<211> 20 
<212> DNA 

<213> Mycob^^f:! .-xium. bovis BCG 



- 9 - 



<220> 
<223> 



TB1.8F 



<400> 
gtccc 



37 

caaac tgcacaccct 



210> 38 
<211> 21 
<212> DNA 

<213> Mycobacterium bovis BCG 
<220> 

<223> TB2.2R 
<400> 38 

aatccggaaa tcgtcagacc g 21 
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